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DERIVATIVES OF PNEUMOCOCCAL CHOLINE BINDING PROTEINS FOR VACCINES 


This application claims the benefit of U.S. Prov. 
Appl'n Serial No. 60/085,743, filed May 15, 1998 and U.S. 

Prov. Appl'n Serial NO 60/080,878, filed April 7, 1998. 

This invention relates generally to the field of 
bacterial antigens and their use, for example, as 
immunogenic agents in humans and animals to stimulate an 
immune response. More specifically, it relates to the 
vaccination of mammalian species with a polypeptide 
comprising an alpha helix- forming polypeptide obtained 
from a choline binding polypeptide as a mechanism for 
stimulating production of antibodies that protect the 
vaccine recipient against infection by pathogenic 

bacterial species. Further, the invention relates to 

antibodies and antagonists against such polypeptides 

diagnosis and passive immune therapy with 
respect to diagnosing and treating such pneumococcal 
infections . 

In a particular aspect, the present invention 

relates to the prevention and treatment of pneumonococcal 
infections such as infections of the middle ear, 

nasopharynx, lung and bronchial areas, blood, CSF, the 

like, that are caused by pneumonococcal bacteria. In this 
regard, certain types of Streptococcus pneumoniae are of 
particular interest . 

S. pneumoniae is a gram positive bacteria which is a 
major causative agent in invasive infections in animals 
and humans, such as sepsis, meningitis, otitis media anri 
lobar pneumonia (Tuomanen, et al. NEJM 322:1280-1284 
(1995)). As part of the infective process, pneumococci 
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readily bind to non- inflamed human epithelial cells of the 
upper and lower respiratory tract by binding to eukaryotic 
carbohydrates in a lectin-like manner (Cundell et al.. 
Micro. Path. 17:361-374 (1994)). Conversion to invasive 
pneumococcal infections for bound bacteria may involve the 
local generation of inflammatory factors which may 
activate the epithelial cells to change the number and 
type of receptors on their surface (Cundell, et al.. 
Nature, 377:435-438 (1995)). Apparently, one such 
receptor, platelet activating factor (PAF) is engaged by 
the pneumococcal bacteria and within a very short period 
of time (minutes) from the appearance of PAF, pneumococci 
exhibit strongly enhanced adherence and invasion of 
tissue. Certain soluble receptor analogs have been shown 
to prevent the progression of pneumococcal infections 
(Idanpaan-Heikkila et al., J. Inf . Dis., 176:704-712 
(1997) ) . 

A family of choline binding proteins (CBPs) , which 
are non- covalently bound to phosphorylcholine, are present 
on the surface of pneumococci and have a non-covalent 
association with teichoic acid or lipoteichoic acid. An 
example of such family is choline binding protein A 
(CbpA) , an approximately 75kD weight type of CBP which 
includes a unique N- terminal domain, a proline rich 
region, and a C- terminal domain comprised of multiple 20 
amino acid repeats responsible for binding to choline . A 
segment of the N- terminal portion of CbpA protein forms an 
alpha helix as part of its three-dimensional structure. 


Accordingly, it is an object of the present 
invention to provide a polypeptide having broad protection 
against pneumococcal infections . 

Definitions 

In order to facilitate understanding of the 
description below and the examples which follow certain 
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frequently occurring methods and/or terms will be 
described . 


" Plasmids " are designated by a lower case- p preceded 
and/or followed by capital letters and/or numbers . The 
starting plasmids herein are either commercially 
available, publicly available on an unrestricted basis, or 
can be constructed from available plasmids in accord with 
published procedures. In addition, equivalent plasmids to 
those described are known in the art and will be apparent 
to the ordinarily skilled artisan. 


"Digestion" of DNA refers to catalytic cleavage of 
the DNA with a restriction enzyme that acts only at 
c ®^tain sequences in the DNA. The various restriction 
enzymes used herein are commercially available «nH their 
reaction conditions, cofactors and other requirements were 
used as would be known to the ordinarily skilled artisan. 

For analytical purposes, typically 1 pg of plasmid or DNA 
fragment is used with about 2 units of enzyme in about 20 
pi of buffer solution. For the purpose of isolating DNA 
fragments for plasmid construction, typically 5 to 50 pg 
of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts 
for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hnnr at 37°C are 
ordinarily used, but may vary in accordance with the 
supplier's instructions. After digestion the reaction is 
electrophoresed directly on a polyacrylamide gel to 
isolate the desired fragment . 


Size separation of the cleaved fragments is 
performed using 8 percent polyacrylamide gel described by 
Goeddel, D. et al., Nucleic Acids Res., 8:4057 (1980). 


"Oligonucleotides" refers to 
stranded polydeoxynucleotide or 
polydeoxynucleotide strands which 


either a single 
two complementary 
may be chemically 
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synthesized . Such synthetic oligonucleotides have no 5' 
phosphate and thus will not ligate to another* 

oligonucleotide without adding a phosphate with an ATP in 
the presence of a kinase . A synthetic oligonucleotide 
will ligate to a fragment that has not been 

dephosphorylated . 

"Ligation” refers to the process of forming 

phosphodiester bonds between two double stranded nucleic 
acid fragments (Maniatis, T., et al., Id., p. 146). 
Unless otherwise provided, ligation may be accomplished 
using known buffers and conditions with 10 units to T4 DNA 

liga.se ("ligase") per 0.5 fig of approximately equimolar 
amounts of the DNA fragments to be ligated. 

"HPS portion" as used herein refers to an amino acid 
sequence as set forth in SEQ ID NO: 2 for a choline binding 
protein ("CBP" ) of a pneumococcal bacteria that may be 
located amino terminal with respect to the proline rich 
portion of the overall amino acid sequence for such CBP. 

The terms "identity", "% identity" or "percent 
identity" as utilized in this application refer to a 
calculation of differences between two contiguous 
sequences which have been aligned for "best fit" (to 
provide the largest number of aligned identical 
corresponding sequence elements, wherein elements are 
either nucleotides or amino acids) and all individual 
differences are considered as individual difference with 
respect to the identity. In this respect, all individual 
element gaps (caused by insertions and deletions with 
respect to an initial sequence ("reference sequence")) 
over the length of the reference sequence and individual 
substitutions of different elements (for individual 
elements of the reference sequence) are considered as 
individual differences in calculating the total number of 
differences between two sequences. Individual differences 
may be compared between two sequences where an initial 
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sequences (reference sequence) has been varied to obtain a 
variant sequence (comparative sequence) or where a new 
sequence (comparative sequence) is simply aligned and 
compared to such a reference sequence. When two aligned 
5 sequences are compared all of the individual gaps in BOTH 
sequences that are caused by the "best fit" alignment over 
the length of the reference sequence are considered 
individual differences for the purposes of identity. If an 
alignment exists which satisfies the stated niinimm 
10 identity, then a sequence has the stated minimum identity 
to the reference sequence. For example, the following is a 
hypothetical comparison of two sequences having 100 
elements each that are aligned for best fit wherein one 
sequence is regarded as the "reference sequence" and the 
15 other as the comparative sequence. All of the individual 
alignment gaps in both sequences are counted over the 
length of the reference sequence and added to the number 
of individual element substitution changes (aligned 
elements that are different) of the comparative sequence 
20 for the total number of element differences. The total 
number of differences (for example 7 gaps and 3 
substitutions) is divided by the total number of elements 
in the length of the reference sequence (100 elements) for 
the "percentage difference" (10/100) . The resulting 
25 percentage difference (10%) is subtracted from 100% 
identity to provide a "% identity" of 90% identity. For 
the identity calculation all individual differences in 
both sequences are considered in the above manner over a 
discrete comparison length (the length of the reference 
30 sequence) of two best fit aligned sequences to determine 
identity. Thus, no algorithm is necessary for such an 
identity calculation. 

"Isolated" in the context of the present invention 
35 with respect to polypeptides and/or polynucleotides means 
that the material is removed from its original environment 
(e.g. , the natural environment if it is naturally 
occurring) . For example, a naturally-occurring 

polynucleotide or polypeptide present in a living organism 
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is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the co-existing 
materials in the natural system, is isolated. Such 
polynucleotides could be part of a vector and/ or such 
5 polynucleotides or polypeptides could be part of a 
composition, and still be isolated in that such vector or 
composition is not part of its natural environment . The 
polypeptides and polynucleotides of the present invention 
are preferably provided in an isolated form, and 
10 preferably are purified to homogeneity. 

Summary of the Invention 

In one aspect the present invention relates to a 
vaccine for treating or preventing pneumococcal bacterial 
15 infections which utilizes as an immunogen at least one 

polypeptide truncate of a pneumococcal surf ace -binding 
protein, analog, or variant having a highly conserved 

immunogenic alpha-helical portion (corresponding generally 
to a "consensus" amino acid sequence as set forth in SEQ 
20 ID NO : 1 ) with respect to different types of pneumococcal 
bacteria , which polypept ide does not include a choline - 
binding portion. Preferably, the C- terminal choline- 
binding portion is absent from such polypeptides. More 
preferred are such polypeptides wherein the HPS amino acid 
25 sequence is also absent. Even further preferred are 

polypeptides wherein the highly conserved immunogenic 
alpha-helical portion corresponding generally to a 
" consensus " amino acid sequence as set forth in SEQ ID 
NO : 1 also corresponds generally to the amino acid sequence 
3 0 as set forth in SEQ ID NO: 19 (amino acids 1 to 103 of SEQ 

ID NO: 19 are identical to amino acids 1 to 103 of SEQ ID 

NO : 1) . Also preferred as vaccines are recomb inant ly- 
produced, isolated polypeptides that are missing both an 
HPS portion and the choline -binding portion. 

35 

More preferred as vaccines are one or more 
polypeptide truncates of pneumococcal surf ace -binding 
proteins, analogs or variants including a single highly 
conserved alpha-helix immunogenic portion with respect to 
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different types of pneumococci, which polypeptides do not 
include a C- terminal choline -binding portion. Further 
preferred are isolated recombinantly produced polypeptides 
having such structure . Also preferred are such 

5 polypeptides that do not include either a C- terminal 
choline-binding portion or a HPS portion. 


10 


15 


20 


25 


30 


The present invention further provides a vaccine 
comprising a polypeptide including an immunogenic portion 
that is capable of forming an alpha helix, which 

polypeptide includes a sequence that has at least 85% 
identity and preferably at least 87% identity to the amino 
acid sequence of SEQ ID NO:l, wherein the isolated 

polypeptide does not include a C-terminal choline-binding 
portion. Further preferred are such polypeptides that 

comprise a polypeptide sequence that has at least 85% 
identity and preferably at least 87% identity to an amino 
acid sequence according SEQ ID NO: 19. Preferably, the 
sequence of the isolated polypeptide includes neither an 
HPS portion (SEQ ID NO: 2) nor a C-terminal choline -binding 
portion. Further preferred are isolated recombinantly 

produced polypeptides having such structure. In 

particular ' such polypeptides corresponding to alpha 
helical structures of different types of s. pneumoniae 
bacteria are contemplated. Particularly preferred are the 
serotypes 1-5, 6A, 6B. 7F, 8, 9N, 9V, 10A, 11A, 12F, 14, 

15B, 17F > 18C, 19F, 19A, 20, 22F, 23F and 33F of such S. 

pneumoniae bacteria. Examples of such serotypes of 

bacteria are readily available from standard ATCC 
catalogs . 


In an additional aspect, the present invention 
further provides a vaccine against s. pneumoniae 
comprising a synthetic or recombinant polypeptide 
35 comprising a plurality of alpha-helical portions, each 
derived from different naturally occurring S. pneumoniae 
choline -binding polypeptides wherein such alpha-helical 
portions have at least 85% identity to the amino acid 
sequence of SEQ ID NO:l, and wherein the isolated 
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polypeptide does not include a choline -binding portion. 
Further preferred are those wherein the amino acid 
sequence for the alpha— helix areas is at least 85% 
identical to the amino acid sequence of SEQ ID NO: 19. 

5 Preferably, such synthetic polypeptide includes neither a 
HPS portion nor a choline -binding portion. Analogs and 
variants of such chain structure polypeptides wherein such 
alpha helical portions may be synthetic variant amino acid 
sequences (or may be a mixture of naturally occurring and 
10 variant sequences) are also contemplated and embraced by 
the present invention. In a preferred aspect, chain 
vaccines polypeptides having at least ten different alpha 
helical structures corresponding to S . pneumoniae 

serotypes 1-5, 6A, 6B, 7F, 8, 9N, 9V, 10A, 11A, 12F, 14, 

15 15B , 17F , 18C, 19F, 19A, 20, 22F, 23F and 33F are 

provided. Further preferred are polypeptides including at 
least fifteen of such alpha-helical structures, more 
preferred are polypeptides including at least 20 such 

alpha-helical structures and more preferred are 
20 polypeptides including at least one alpha-helical 
structure corresponding to each of the S . pneumoniae 
serotypes 1-5, 6A, 6B, 7F, 8, 9N, 9V, 10A, 11A, 12F, 14, 

15B, 17F, 18C, 19F, 19A, 20, 22F, 23F and 33F. Another 

preferred polypeptide comprises each of the alpha helical 
25 structures from the amino acid sequences of SEQ ID NOS: 3- 
18 which correspond to SEQ ID NO:l. 

In another aspect, the invention relates to passive 
immunity vaccines formulated from antibodies against a 
30 polypeptide including a highly conserved immunogenic 
portion with respect to different types of pneumococcal 
bacteria which portion is capable of forming an alpha- 
helix having the hereinbefore described identity to the 
amino acid sequence of SEQ ID NO:l, which polypeptide does 
35 not include a C- terminal choline -binding portion, wherein 
said antibodies will bind to at least one S. pneumoniae 
species. Preferably, if such polypeptide is a truncate of 
a native pneumococcal surf ace -binding protein both its HPS 
portion (where applicable) and its choline -binding portion 
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are absent from such polypeptide. Such passive immunity 
vaccines can be utilized to prevent and/or treat 
pneumococcal infections in immunocompromised patients, 
patients having an immature immune system (such as young 
5 children) or patients who already have an ongoing 

infection. In this manner, according to a further aspect 
of the invention, a vaccine can be produced from a 
synthetic or recombinant polypeptide wherein the 
polypeptide includes the conserved alpha helical portions 
10 of two or more different choline binding polypeptides of 
S. pneumoniae. 


This invention also relates generally to the use of 
an isolated polypeptide having a highly conserved 
15 immunogenic portion with respect to different types of 
pneumococcal bacteria which portion is capable of forming 
an alpha-helix (corresponding generally to SEQ ID NO:l or 
to SEQ ID NO: 19) wherein the isolated polypeptide does not 
include a choline -binding portion, to raise antibodies in 
20 non-human mammalian species useful, for example, as 
diagnostic reagents and vaccines . 

In yet another aspect, the present invention relates 
to the production of a polypeptide including a highly 
25 conserved immunogenic portion with respect to different 

types of pneumococcal bacteria which portion is capable of 
forming an alpha-helix whose sequence corresponds 
generally to the amino acid sequence of SEQ ID NO:l or SEQ 
ID NO: 19, wherein the isolated polypeptide does not 
30 include a choline-binding portion. Preferably, such 
recombinant production is of a truncated native 
pneumococcal surface -binding polypeptide wherein both the 
HPS portion (where applicable) and the choline-binding 
portion are absent . 

35 

In still another aspect, the present invention 
provides an isolated choline -binding polypeptide, wherein 
the non- choline binding region of such polypeptide has at 
least 90% identity to the corresponding amino acid 
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sequence portion of a naturally occurring pneumococcal 
surf ace -binding protein which is a member selected from 
the group consisting of SEQ ID NOS: 3-18. The invention 
relates to fragments of such polypeptides which include at 
5 least the conserved alpha- helical portion corresponding 
generally to SEQ ID NO:l, and which has at least 85% 
identity thereto, wherein the isolated polypeptide 
preferably is free of a choline binding region. 

]_0 In another aspect the present invention provides an 

isolated polypeptide comprising an amino acid sequence 
which has at least 90% identity to one of the amino acid 
sequences selected from the group consisting of SEQ ID 
NO: 3-18. Preferably, such isolated polypeptide comprises 
15 an amino acid sequence which has at least 95% identity , 
and more preferably 97 % identity, to one of the amino acid 
sequences selected from the group consisting of SEQ ID 
NO:3-18. The invention further relates to fragments of 
such polypeptides. 

20 

In a yet further aspect, the present invention 
provides a S. pneumoniae CBP polypeptide encoded by a 
polynucleotide that will hybridize under highly stringent 
conditions to the complement of a polynucleotide encoding 
25 a polypeptide having an amino acid selected from the group 
consisting of SEQ ID NOS:l and 3-18. Particularly 

preferred are polypeptides comprising an amino acid 
sequence segment that is at least 90% identical to the 
amino acid sequence of SEQ ID N0:1. Further preferred are 
3 0 such polypeptides comprising a contiguous amino acid 
sequence that has at least 95% identity with respect to 
the amino acid sequence of SEQ ID NO : 1 . And, even more 
preferred are polypeptides comprising an amino acid 
sequence that has at least 97% identity with respect to 
3 5 the amino acid sequence of SEQ ID NO:l. 

In another aspect the present invention provides 
polynucleotides which encode the hereinabove described 
polypeptides of the invention. The polynucleotide of the 


10 
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present invention may be in the form of RNA or in the form 
of DNA., which DNA includes cDNA, genomic DNA, anri 
synthetic DNA. The DNA may be double- stranded or single- 
stranded, and if single stranded may be the coding strand 
5 or non-coding (anti-sense) strand. The polynucleotides 
which encode polypeptides including the amino acid 
sequences of at least one of SEQ ID NOS:3-i8 (or 
polypeptides that have at least 90% identity to the amino 
acxd sequences of such polypeptides) may be one of the 
10 coding sequences shown in SEQ ID NOS: 20-35 or may be of a 
^iff erent coding sequence which coding sequence, as a 
result of the redundancy or degeneracy of the genetic 
code, encodes the same polypeptides as the DNA of SEQ ID 
NOS : 2 0 - 3 5 . 

15 

The polynucleotides which encode the polypeptides of 
SEQ ID NOS: 3-18 may include: only the coding sequence for 

the polypeptide; the coding sequence for the polypeptide 
(and optionally additional coding sequence) and non-coding 
20 sequence, such as introns or non-coding sequence 5' and/or 
3 of the coding sequence for the polypeptide . The 
polypeptides encoded may comprise just a single alpha- 
helical portion or multiple alpha-helical portion an<i may 
independently or collectively include N-terminal sequences 
25 5' of such alpha helical areas and/or sequences 

corresponding to the "X" structures or proline rich areas 
(as set forth in Figure 1, for example) . 


30 


35 


The invention further relates to a polynucleotide 
comprising a polynucleotide seqeunce that has at least 95% 
identity and preferably at least 97% identity to a 
polynucleotide encoding one of the polypeptides comprising 
SEQ ID NO: 3 -18. The invention further relates to 
fragments of such polynucleotides which include at least 
the portion of the polynucleotide encoding the polypeptide 
sequence corresponding to SEQ ID NO:l. 


Thus, the term "polynucleotide encoding a 
polypeptide" encompasses a polynucleotide which includes 
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only coding sequence for the polypeptide as well as a 
polynucleotide which includes additional coding and/or 
non- coding sequence. In particular, the polypeptides may 
include any or all of the types of structures set forth 
5 schematically in Figure 1 . 

The present invention further relates to variants of 
the hereinabove described polynucleotides which encode for 
fragments, analogs and derivatives of the polypeptides 
10 including the amino acid sequences of SEQ ID NOS: 3-18. 

The variants of the polynucleotides may be a naturally 
occurring allelic variant of the polynucleotides or a non- 
naturally occurring variant of the polynucleotides . 
Complements to such coding polynucleotides may be utilized 
15 to isolate polynucleotides encoding the same or similar 
polypeptides. In particular, such procedures are useful 
to obtain alpha helical coding segments from different 
serotypes of S. pneumoniae, which is especially useful in 
the production of 11 chain” polypeptide vaccines containing 
20 multiple alpha helical segments . 

Thus , the present invention includes polynucleotides 
encoding polypeptides including the same polypeptides as 
shown in the Sequence Listing as SEQ ID NOS : 3-18 as well 
25 as variants of such polynucleotides which variants encode 
for a fragment, derivative or analog of the polypeptides 
of SEQ ID NOS : 3 - 18 . Such nucleotide variants include 
deletion variants, substitution variants and addition or 
insertion variants. 

30 As hereinabove indicated, the polynucleotides may 

have a coding sequence which is a naturally occurring 
allelic variant of the coding sequence shown in the 
Sequence. Listing as SEQ ID NOS: 20-35. As known in the 
art, an allelic variant is an alternate form of a 
3 5 polynucleotide sequence which may have a substitution, 
deletion or addition of one or more nucleotides, which 
does not substantially alter the function of the encoded 
polypeptide . 


12 
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The polynucleotides of the present invention may 
also have the coding sequence fused in frame to a marker 
sequence which allows for purification of the polypeptides 
of the present invention. The marker sequence may be, for 
5 example, a hexa-histidine tag supplied by a pQE-9 vector 
to provide for purification of the mature polypeptides 
fused to the marker in the case of a bacterial host, or, 
for example, the marker sequence may be a hemagglutinin 
(HA) tag when a mammalian host, e.g. COS-7 cells, is used. 

0 The HA tag corresponds to an epitope derived from the 

influenza hemagglutinin protein (Wilson, I., et al.. Cell, 

37:767 (1984) ) . 


The present invention further relates to 
15 polynucleotides (hybridization target sequences) which 
hybridize to the complements of the hereinabove -described 
sequences if there is at least 70% and preferably 80% 
identity between the target sequence and the complement of 
the sequence to which the target sequence hybridizes, 
20 preferably at least 85% identity. More preferred are such 
sequences having at least 90% identity, preferably at 
least 95% and more preferably at least 97% identity 
between the target sequence and the sequence of complement 
of the polynucleotide to which it hybridizes. The 

25 invention further relates to the complements to both the 
target sequence and to the polynucleotide sequence that 
encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOS: 3 to 18. The present invention 
particularly relates to polynucleotides which hybridize 
30 under stringent conditions to the complements of the 
hereinabove -described polynucleotides as well as to those 
complements . As herein used, the term "stringent 

conditions" means hybridization will occur with the 
complement of a polynucleotide and a corresponding 
35 sequence only if there is at least 95% and preferably at 
least 97% identity between the target sequence anH the 
sequence of complement of the polynucleotide to which it 
hybridizes . The polynucleotides which hybridize to the 
complements of the hereinabove described polynucleotides 
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in a preferred embodiment encode polypeptides which retain 
an immunogenic portion that will cross -react with an 
antibody to at least one of the polypeptides having a 
sequence according to SEQ ID NOS: 3-18, or to a polypeptide 
5 that includes an amino acid sequence which has at least 
85% identity to that of SEQ ID NO:l. 

In a still further aspect, the present invention 
provides for the production of such polypeptides and 
10 vaccines as set forth above having a histidine label (or 
other suitable label) such that the full-length proteins, 
truncates , analogs or variant discussed above can be 
isolated due to their label . 

15 In another aspect the present invention relates to a 

method of prophylaxis and/or treatment of diseases that 
are mediated by pneumococcal bacteria that have surface- 
binding CBP proteins. In particular, the invention 
relates to a method for the prophylaxis and/or treatment 
2 0 of infectious diseases that are mediated by S. pneumoniae 
that have a CBP surface -binding protein that forms an 
alpha helix (comprising a sequence that has at least an 
85% identity to the amino acid sequence of SEQ ID NO:l) . 
In a still further preferred aspect, the invention relates 
25 to a method for the prophylaxis and/or treatment of such 
infections in humans . 

In still another aspect the present invention 
relates to a method of using one or more antibodies 
30 (monoclonal, polyclonal or sera) to the polypeptides of 
the invention as described above for the prophylaxis 
and/or treatment of diseases that are mediated by 
pneumococcal bacteria that have CBP surf ace -binding 
proteins. In particular, the invention relates to a 
35 method for the prophylaxis and/or treatment of infectious 
diseases that are mediated by S. pneumoniae CBP proteins 
which include an alpha helical portion having the 
hereinbefore described identity to the consensus sequence 
of SEQ ID NO:l. In a still further preferred aspect, the 
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invention relates to a method for the prophylaxis and/or 
treatment of otitis media, nasopharyngeal, bronchial 
infections, and the like in humans by utilizing antibodies 
to the alpha-helix containing immunogenic polypeptides of 
5 the invention as described above . 


15 
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Brief Description of Drawings 

Figure 1 is a diagram of a pneumococcal CBP protein 
which shows from the N- terminal to the C- terminal, 
5 respectively, (a) a N- terminal sequence, (b) one of a 
potential alpha-helical forming area conserved segment 
(Rl) that may not be present in some CBP polypeptides, (c) 
an optional small bridging sequence of amino acids that 
may bridge two conserved alpha-helical segments (X) , (d) a 

10 second of a potential alpha-helical forming area consensus 
sequence (R2) related to the first consensus sequence 
(which corresponds to SEQ ID NO:l), (e) a proline rich 
area sequence, (f) a choline binding repeats area, and 
(e) a C- terminal tail sequence. Where relevant, an 
15 optional HPS sequence may naturally occur 5' of the 
proline rich sequence and 3' of the Rl, X, and/or R2 
areas . 

Figure 2 reports the results for passive immunity 
20 protection against 1600 cfu virulent serotype 6B 

S. pneumoniae SP317 (in mice) that was provided by day 31 
rabbit antisera to a pneumococcal CBP truncate 

polypeptide, NR1XR2 (truncate missing both the proline and 
the choline binding areas , but including two conserved 

25 alpha-helical areas Rl and R2) . Eighty percent of the 

mice immunized with the truncate antisera prior to 
challenge survived the 14 day observation period. By 
contrast, all mice immunized with a control sera (pre- 
immune rabbit sera) were dead by day 7 . 

30 

Figure 3 reports the results for passive immunity 
protection against 3450 cfu virulent serotype 6B 

S. pneumoniae SP317 (in mice) that was provided by day 52 
rabbit antisera to a pneumococcal CBP truncate 
35 polypeptide, NR1XR2 (truncate missing both the proline and 
the choline binding areas, but including two conserved 

alpha-helical areas Rl and R2) . One hundred percent of 
the mice immunized with the truncate antisera prior to 
challenge survived the 10 day observation period. By 


_9951266A2_L> 


r6 



WO 99/51266 


PCT/US99/07680 


BNSDOCID: <WO. 


contrast, ninety percent of the mice immunized with a 
control sera (pre-immune rabbit sera) were dead at day 10. 

Figure 4 reports the results for passive immunity 
5 protection against 580 cfu virulent serotype 6B 
S. pneumoniae SPSJ2 (in mice) that was provided by day 31 
rabbit antisera to a pneumococcal CBP truncate 
polypeptide, NR1XR2 (truncate missing both the proline and 
the choline binding areas, but including two conserved 
10 alpha-helical areas Rl and R2) . Fifty percent of the mice 
immunized with the truncate antisera prior to challenge 
survived the 10 day observation period. By contrast, all 
mice immunized with a control sera (pre-immune rabbit 
sera) were dead by day 8 . 

15 

Figure 5 reports the results for active immunity 
protection against 560 cfu virulent serotype 6B 

S. pneumoniae SPSJ2 (in mice) that was provided by 

immunization with a pneumococcal CBP truncate polypeptide, 
20 NR1X (truncate missing the second conserved alpha-helical 
area R2, as well as both the proline and the choline 
binding areas) . Eighty percent of the mice actively 
immunized with the NR1X CBP truncate prior to challenge 
survived the 14 day observation period. By contrast, all 
25 mice immunized with a control (sham mice) of PBS and 
adjuvant were dead by day 8. 

Figure 6 reports the results for active immunity 
protection against 680 cfu virulent serotype 6B 

30 S . pneumoniae SPSJ2 (in mice) that was provided by 

immunization with a pneumococcal CBP truncate polypeptide, 
NR1XR2 (truncate missing both the proline and the choline 
binding areas, but including two conserved alpha-helical 
areas Rl and R2) . Fifty percent of the mice actively 
35 immunized with the NR1XR2 CBP truncate prior to challenge 
survived the 14 day observation period. By contrast, all 
mice immunized with a control (SP90) protein and adjuvant 
were dead by day 9 . 
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15 

Figure 9 is an alignment report for a first helical 
region in the amino acid sequences of CBP polypeptides 
20 from various types of S. pneumoniae and a consensus 

sequence is reported at the top of each row (sets of 
lines) of the comparison. The consensus sequence for the 
comparison is listed as the "Majority sequence (SEQ ID 
UO:38) . One letter codes are utilized to represent the 
25 sequences which are aligned for a "best fit" comparison 

wherein dashes in a sequence indicate spacing gaps of the 
contiguous sequence. 

Figure 10 shows the sequence pair distances for the 

3 0 amino acid sequences as described for Figure 9 and set 

forth therein. A Clustal method with identity residue 
weight table is used. The percent similarity for such a 
- comparison is reported for the amino acid sequences set 
forth in Figure 9 . 

35 

Figure 11 is an alignment report for the region X in 
the amino acid sequences of CBP polypeptides from various 
types of S. pneumoniae and a consensus sequence is 
reported at the top of each row (sets of lines) of the 
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Figure 7 is an alignment report of the ammo 
terminus of CBP polypeptides from various types of S. 
pneumoniae and a consensus sequence is reported at the top 
of each row (sets of lines) of the comparison. The 
consensus sequence for the comparison is listed as the 
"Majority" sequence (SEQ ID NO: 36) . One letter codes are 
utilized to represent the sequences which are aligned for 
a "best fit" comparison wherein dashes in a sequence 
indicate spacing gaps of the contiguous sequence. 

Figure 8 shows the sequence pair distances for the 
amino acid sequences as described for Figure 7 and set 
forth therein. A Clustal method with identity residue 
weight table is used. The percent similarity for such a 
comparison is reported for the amino acid sequences set 
forth in Figure 7 . 
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comparison. The consensus sequence for the comparison is 
listed as the “Majority" sequence (SEQ ID MO:37) . One 
letter codes are utilised to represent the sequences which 
are aligned for a "best fit- comparison wherein dashes in 

5 a sequence indicate spacing gaps of the contiguous 
sequence . 


Figure 12 shows the sequence pair distances for the 
amino acid sequences as described for Figure 11 and set 
10 forth therein. A Clustal method with identity residue 
weight table is used. The percent similarity for such a 

comparison is reported for the amino acid sequences set 
forth in Figure 11. 

Figure 13 is an alignment report for the second 
helical region A in the amino acid sequences of CBP 
polypeptides from various types of s. pneumoniae and a 
consensus sequence is reported at the top of each row 
(sets of lines) of the comparison. The consensus sequence 
for the conparison is listed as the "Majority" sequence 
(SEQ id NO:l). One letter codes are utilized to represent 
the sequences which are aligned for a "best fit" 
comparison wherein dashes in a sequence indicate spacing 
gaps of the contiguous sequence « 

Figure 14 shows the sequence pair distances for the 
amino acid sequences as described for Figure 13 and set 
forth therein a Clustal method with identity residue 
weight table is used. The percent similarity for such a 

comparison is reported for the amino acid sequences set 
forth in Figure 13 . 

Figure 15 is an alignment report for the second 
helical region B in the amino acid sequences of CBP 
35 polypeptides from various types of 5. pneumoniae and a 
consensus sequence is reported at the top of each row 
(sets of lines) of the comparison. The consensus sequence 
for the comparison is listed as the "Majority" sequence 
(SEQ ID NO: 19). one letter codes are utilized to 
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25 


30 
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represent the sequences which are aligned for a "best fit" 
comparison wherein dashes in a sequence indicate spacing 
gaps of the contiguous sequence . 

5 Figure 16 shows the sequence pair distances for the 

amino acid sequences, as described for Figure 15 and set 
forth therein. A Clustal method with identity residue 
weight table is used. The percent similarity for such a 
comparison is reported for the amino acid sequences set 
10 forth in Figure 15. 


Detailed Description of the Invention 

j _5 Xn accordance with an aspect of the present 

invention there is provided a vaccine to produce a 
protective response against S. pneumoniae infections which 
employs a polypeptide which comprises a member selected 
from the group consisting of : 

2o (a) an amino acid sequence which produces an alpha 

helical structure and which is at least 85% identical to 
the amino acid sequence of SEQ ID NO : 1 and which is free 

of a choline binding region, and 

(b) an isolated truncate of a naturally occurring 
25 S. pneumoniae polypeptide that comprises an alpha helical 

portion that has at least 85% identity to the amino acid 
sequence of SEQ ID NO:l and is free of a choline binding 
region, 

(c) an isolated truncate of a naturally occurring 
30 S. pneumoniae polypeptide that comprises an alpha helical 

portion that has at least 90% identity to the amino acid 
sequence of SEQ ID NO: 19 and is free of a choline binding 
region. In a preferred aspect, such isolated truncate 
polypeptide is a member selected from the group consisting 
35 of SEQ ID NOS: 3-18 and said isolated polypeptide is free 
of a choline binding region and, if relevant, a HPS 
region; or a fragment thereof which includes at least the 
alpha helical segment which corresponds to the consensus 
sequence of SEQ ID NO:l. Particularly preferred are 
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vaccines which utilize such truncate polypeptides that 
include at least such alpha helical area or utilize a 
recombinant immunogen polypeptide comprising at least two 
such alpha-helical segments . Such polypeptide may be 
a recombinant polypeptide containing multiple alpha- 
helical areas from one or more trucates. Further 
P re ^ erre ^ are recombinant immunogen polypeptides 
comprising at least two alpha- helical areas corresponding 
^" 0 alpha helical areas of two or more truncates from 

different types of pneumococcal bacteria. Such 

polypeptide may be a recombinant polypeptide containing 
multi P le alpha- helical areas from one or more different 
types of pneumococcal bacteria.. 


^ I n accordance with the present invention, there is 

provided an isolated polypeptide comprising a truncated 
surface -binding polypeptide derived from S . pneumoniae, 
said isolated polypeptide containing an alpha-helical area 
whose amino acid sequence corresponds generally to the 
2 0 amino acid sequence of SEQ ID NO:l, but free of a choline 
binding area. Preferably, said isolated polypeptide also 
omits any naturally occurring repeats of the alpha-helical 
forming area and omits any HPS amino acid sequence that 
may be present . 

25 

It is an object of the present invention to utilize 
as immunogenic composition for a vaccine (or to produce 
antibodies for use as a diagnostic or as a passive 
vaccine) comprising an immunogenic polypeptide comprising 
30 a pneumococcal surface-binding polypeptide with an alpha 
helical portion from which a choline binding region has 
been omitted. In one embodiment, such truncated proteins 
(naturally or recombiantly produced, as well as functional 
ana l°9 s ) from S . pneumoniae bacteria are contemplated . 

35 Even more particularly, S. pneumoniae polypeptides having 
a single alpha helical portion that omit any HPS areas 
that occur and choline binding areas of the native protein 
are contemplated . 
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A particularly preferred embodiment of such an 
immunogenic composition is for use as a vaccine (or as an 
immunogen for producing antibodies useful for diagnostics 
or vaccines) wherein the active component of the 
5 immunogenic composition is an isolated polypeptide 
comprising at least one member selected from the group 
consisting of : 

(a) an amino acid sequence which is selected from 
SEQ ID NOS: 3-19, 

3_0 (b) a polypeptide which has at least 90% identity 

to (a) , preferably at least 95% identity to (a) , and even 
more preferred at least 97% identity to (a) , or 

(c) a fragment of (a) or (b) wherein such fragment 
includes at least one alpha helical portion that 
15 corresponds to the consensus sequence which is SEQ ID NO:l 
arid said fragment does not comprise a choline binding 
region. Preferably, such vaccines utilize a polypeptide 
that contains neither a choline binding region nor an HPS 
region that occurs as part of the amino acid sequences in 
20 the native proteins. 

In another preferred embodiment , there is provided a 
vaccine which includes at least one isolated polypeptide 
which includes an amino acid sequence which has at least 
25 85% identity (preferably 87% identity and more preferably 

at least 90% identity) to SEQ ID NO:l, which isolated 
polypeptide is free of a choline binding portion and, 
where applicable, is also preferably free of an HPS 
portion. The preferred polypeptide may also include one 
30 or more of the N- terminal sequences that are located 5' of 
the alpha helical areas in the polypeptides having an 
amino acid sequence selected from the group consisting of 
SEQ ID NOS: 3-18, or the like. The polypeptide truncate 
may also include one or more of the proline regions 
35 (region "P n in Figure 1) and/or the spanning region 
(region "X" in Figure 1) . 

In another aspect of the invention, such an 
immunogenic composition may be utilized to produce 
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antibodies to diagnose pneumococcal infections, or to 
produce vaccines for prophylaxis and/or treatment of such 
pneumococcal infections as well as booster vaccines to 
maintain a high titer of antibodies against the 
5 immunogen (s) of the immunogenic composition. 

While other antigens have been contemplated to 
produce antibodies for diagnosis and for the prophylaxis 
and/or treatment of pneumococcal infections, there is a 
10 need for improved or more efficient vaccines. such 
vaccines should have an improved or enhanced effect in 
preventing bacterial infections mediated pneumococci 
having surface -binding polypeptides. Further, to avoid 
unnecessary expense and provide broad protection against a 
15 range of pneumococcal serotypes there is a need for 
polypeptides that comprise an immunogenic amino acid 
sequence corresponding to a portion of pneumococcal 
surface -binding polypeptides that is a highly conserved 
portion among various types of pneumococci. Preferably, 

20 such polypeptides avoid the inclusion of amino acid 
sequences corresponding to other portions of the native 

polypeptides, such as the choline binding region and/or 
the HPS region. 


There is a need for improved antigenic compositions 
comprising highly conserved portions of polypeptides that 
bind to the surface of pneumococcal bacteria for 
stimulating high-titer specific antisera to provide 
protection against infection by pathogenic pneumococcal 
30 bacteria and also for use as diagnostic reagents. 

in such respect, truncated polypeptides, functional 
ariant analogs , and recombinantly produced truncated 
polypeptides of the invention are useful as immunogens for 
35 preparing vaccine compositions that stimulate the 
production of antibodies that can confer immunity against 
pathogenic species of bacteria. Further, preparation of 
vaccines containing purified proteins as antigenic 
ingredients are well known in the art. 
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Generally, vaccines are prepared as injectables, in 
the form of aqueous solutions or suspensions. Vaccines in 
an oil base are also well known such as for inhaling. 

5 solid forms which are dissolved or suspended prior to use 
may also be formulated. Pharmaceutical carriers are 
generally added that are compatible with the active 
ingredients and acceptable for pharmaceutical use. 
Examples of such carriers include, but are not limited to, 

10 water, saline solutions, dextrose, or glycerol. 
Combinations of carriers may also be used. 

Vaccine compositions may further incorporate 
additional substances to stabilize pH, or to function_ as 
15 adjuvants, wetting agents, or emulsifying agents, which 
can serve to improve the effectiveness of the vaccine. 

Vaccines are generally formulated for parenteral 
administration and are injected either subcutaneously or 
20 intramuscularly. Such vaccines can also be formulated as 
suppositories or for oral administration, using methods 

known in the art. 

The amount of vaccine sufficient to confer immunity 
25 to pathogenic bacteria is determined by methods well known 
to those skilled in the art. This quantity will be 
determined based upon the characteristics of the vaccine 
recipient and the level of immunity required. Typically, 
the amount of vaccine to be administered will be 
30 determined based upon the judgment of a skilled physician. 

Where vaccines are administered by subcutaneous or 
intramuscular injection, a range of 50 to 500 pg purified 
protein may be given. 

35 The term "patient in need thereof" refers to a human 

that is infected with, or likely, to be infected with, 
pathogenic pneumococcal bacteria that produce CbpA, or the 
like, preferably S. pneumoniae bacteria (however a mouse 
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model can be utilized to simulate such a patient in some 
circumstances ) . 

In addition to use as vaccines, the polypeptides of 
5 the present invention can be used as immunogens to 
stimulate the production of antibodies for use in passive 
immunotherapy, for use as diagnostic reagents, and for use 
as reagents in other processes such as affinity 
chromatography . 

10 

The polynucleotides encoding the immunogenic 
polypeptides described above may also have the coding 
sequence fused in frame to a marker sequence which allows 
for purification of the polypeptides of the present 
15 invention. The marker sequence may be, for example, a 
hexa- histidine tag supplied by a pQE-9 vector to provide 
for purification of the mature polypeptides fused to the 
. marker in the case of a bacterial host, or, for example, 
the marker sequence may be a hemagglutinin (HA) tag when a 
20 mammalian host, e.g. COS-7 cells, is used. The HA tag 
corresponds to an epitope derived from the influenza 
hemagglutinin protein (Wilson, I., et al.. Cell, 37:767 
(1984) ) . 

25 The identification of multiple coil structures of 

alpha helical amino acid segments in the S. pneumoniae 
polypeptides according to the invention may be determined 
by the location of proline rich areas with respect to one 
smother. Further the "X" area optionally located between 
30 two or more alpha-helical structures can play a part in 
the formation of a coil within a coil structure . Standard 
three-dimensional protein modeling may be utilized for 
• determining the relative shape of such structures . An 
example of a computer program, the Paircoil Scoring Form 
35 Program ("PairCoil program"), useful for such three- 
dimensional protein modelling is described by Berger et 
al. in the Proc. Natl. Acad, of Sci. (USA), 92:8259-8263 
(August 1995) . The PairCoil program is described as a 
computer program that utilizes a mathematical algorithm to 
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predict locations of coiled- coil regions in amino acid 
sequences . A further example of such a computer program 
is described by Wolf et al . , Protein Science 6:1179-1189 
(June 1997) which is called the Multicoil Scoring Form 
5 Program ("Multicoil program"). The MultiCoil program is 
based on the PairCoil algorithm and is useful for locating 
dimeric and trimeric coiled coils . 

In a preferred aspect , the invention provides for 
10 recombinant production of such polypeptides in a host 
bacterial cell other than a S. pneumoniae species host to 
avoid the inclusion of native surface-binding polypeptides 
that have a choline binding region. A preferred host is a 
species of such bacteria that can be cultured under 
15 conditions such that the polypeptide of the invention is 
secreted from the cell . 

The present invention also relates to vectors which 
include polynucleotides encoding one or more of the 
20 polypeptides of the invention that include the highly 
conserved alpha-helical amino acid sequence in the absence 
of an area encoding a choline binding amino acid sequence, 
host cells which are genetically engineered with vectors 
of the invention and the production of such immunogenic 
25 polypeptides by recombinant techniques in an isolated and 
substantially immunogenically pure form. 

Host cells are genetically engineered (transduced or 
transformed or transfected) with the vectors comprising a 
30 polynucleotide encoding a polypeptide comprising the 
highly conserved alpha-helical region but not having a 
choline binding region, or the like of this invention 
which may be, for example, a cloning vector or an 
expression vector. The vector may be, for example, in the 
35 form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional 
nutrient media modified as appropriate for activating 
promoters, selecting transformants or amplifying the 
polynucleotides which encode such polypeptides. The 
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culture conditions, such as temperature, pH an h the like, 
are those previously used with the host cell selected for 
expression, and will be apparent to the ordinarily skilled 
artisan. 

5 

Vectors include chromosomal , nonchromosomal and 
synthetic DNA sequences, e.g., derivatives of SV40; 
bacterial plasmids; phage DNA; baculovirus; yeast 

plasmids; vectors derived from combinations of plasmids 
10 and phage DNA, viral DNA such as vaccinia, adenovirus, 
fowl pox virus, and pseudorabies. However, any other 

vector may be used as long as it is replicable anrf viable 
in the host . 


15 The appropriate DNA sequence may be inserted into 

the vector by a variety of procedures. In general, the 
DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such 
procedures and others are deemed to be within the scope of 
20 those skilled in the art. 

The DNA sequence in the expression vector is 
operatively linked to an appropriate expression control 
sequence (s) (promoter) to direct mRNA synthesis. As 
25 representative examples of such promoters, there may be 
mentioned: LTR or SV40 promoter, the E . coli , lac or trp , 
the phage lambda P L promoter and other promoters known to 
control expression of genes in prokaryotic or eukaryotic 
cells or their viruses. The expression vector also 
30 contains a ribosome binding site for translation 
initiation and a transcription terminator. The vector may 
also include appropriate sequences for amplifying 
expression. 


In addition, the expression vectors preferably 
contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells 
such as dihydrofolate reductase or neomycin resistance for 
eukaryotic cell culture, or such as tetracycline or ampi- 
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cillin resistance in E. coli . 

The vector containing the appropriate DNA sequence 
as hereinabove described, as well as an appropriate 
5 promoter or control sequence, may be employed to transform 
an appropriate host to permit the host to express the 
proteins. 

As representative examples of appropriate hosts, 
10 there may be mentioned: bacterial cells, such as E. coli , 
Streptomyces , Salmonella typhimur ium ; fungal cells, such 
as yeast ; insect cells such as Drosophila S2 and 
Spodoptera Sf9; animal cells such as CHO, COS or Bowes 
melanoma; adenoviruses; plant cells, etc. The selection 
15 of an appropriate host is deemed” "to be within the scope of 
those skilled in the art from the teachings herein. 

More particularly, the present invention also 
includes recombinant constructs comprising one or more of 
2 0 the sequences as broadly described above . The constructs 

comprise a vector, such as a plasmid or viral vector, into 
which a sequence of the invention has been inserted, in a 
forward or reverse orientation. In a preferred aspect of 
this embodiment, the construct further comprises 

2 5 regulatory sequences , including , for example , a promoter , 
operably linked to the sequence . Large numbers of 

suitable vectors and promoters are known to those of skill 
in the art, and are commercially available. The following 
vectors are provided by way of example. Bacterial: pQE70, 
30 pQE6 0 , pQE-9 (Qiagen, Inc. ) , pbs, pDIO, phagescript, 
psiX174, pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, 
pNH4 6 A (Stratagene) ; ptrc99a, pKK223-3, pKK233-3, pDR540, 

pRIT5 (Pharmacia) . Eukaryotic: pWLNEO, pSV2CAT, pOG44, 

pXTl , pSG ( Stratagene ) pSVK3 , pBPV , pMSG , pSVL 

35 (Pharmacia) . However, any other plasmid or vector may be 
used as long as they are replicable and viable in the 
host . 

Promoter regions can be selected from any desired 
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10 


15 


20 


25 


30 


35 


gene using cat ( chloramphenicol transferase) vectors or 
other vectors with selectable markers. Two appropriate 
vectors are PKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P p 
and TRP. Eukaryotic promoters include CMV Mediate 
early, HSV thymidine kinase, early and late SV40 LTRs 
from retrovirus, and mouse metallothionein- 1 . Selection 

of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

in a further embodiment. the present invention 

relates to host cells containing , the above-described 
constructs The host cell can be a higher eukaryotic 
ell, such as a mammalian cell, or a lower eukaryotic 
cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. introduction 
of the construct into the host cell can be effected by 
calcium phosphate transfection, DEAE-Dextran mediated 
ransfection, or electroporation (Davis, L. , Dibner, M., 
Battey, I., Basic Methods in Molecular Biology, (1986)). 

The constructs in host cells can be used in a 
conventional manner to produce the gene product encoded by 
he recombinant sequence. Alternatively, the polypeptides 

e invention can be synthetically produced by 
conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, 
yeast, bacteria, or other cells under the control of 
appropriate promoters. Cell-free translation systems can 
also be employed to produce such proteins using RHAs 
enved from the DMA constructs of the present invention. 
Appropriate cloning and expression vectors for use with 
prokaryotic and eukaryotic hosts are described by 
ambrook. et al., Molecular cloning, A Laboratory Manual. 

con Edition, cold Spring Harbor, H.Y., (1989), the 

isclosure of which is hereby incorporated by reference. 

Transcription of the DMA encoding the polypeptides 
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of the present invention by higher eukaryotes is increased 
by inserting an enhancer sequence into the vector. 
Enhancers are cis -acting elements of DNA, usually about 
from 10 to 3 00 bp that act on a promoter to increase its 
5 transcription. Examples including the SV40 enhancer on the 
late side of the replication origin bp 100 to 270, a 
cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and 
adenovirus enhancers . 

10 

Generally, recombinant expression vectors will 
include origins of replication and selectable markers 
permitting transformation of the host cell, e.g., the 
ampicillin resistance gene of E. coli and S^ — cerevisiae 
15 TRP1 gene, and a promoter derived from a highly-expressed 
gene to direct transcription of a downstream structural 
sequence . Such promoters can be derived from operons 

encoding glycolytic enzymes such as 3 -phosphoglycerate 
kinase (PGK) , a- factor, acid phosphatase, or heat shock 
20 proteins, among others. The heterologous structural 
sequence is assembled in appropriate phase with 

translation initiation and termination sequences. 
Optionally, the heterologous sequence can encode a fusion 
protein including an N- terminal identification peptide 
25 imparting desired characteristics, e.g., stabilization or 
simplified purification of expressed recombinant product. 


Useful expression vectors for bacterial use are 
constructed by inserting a structural DNA sequence 
30 encoding a desired protein together with suitable 
translation initiation and termination signals in operable 
reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an 
origin of replication to ensure maintenance of the vector 
35 gT to, if desirable, provide amplification within the 

host. Suitable prokaryotic hosts for transformation 
include E . coli , Bacillus subtilis . Salmonella typhimurium 
and various species within the genera Pseudomonas, 
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Streptomyces , and Staphylococcus, although others may also 
be employed as a matter of choice. 


10 


As a representative but nonlimiting example, useful 
expression vectors for bacterial use can comprise a 
selectable marker and bacterial origin of replication 
derived from commercially available plasmids comprising 
genetic elements of the well known cloning vector pBR322 
(ATCC 37017) . Such commercial vectors include, for 
example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala 
Sweden) and GEM1 (Promega Biotec, Madison, WI, USA) . ' 
T hese pBR322 "backbone" sections are combined with an 

appropriate promoter and the structural sequence to be 
expressed. 


15 


Following transformation of a suitable host strain 
and growth of the host strain to an appropriate cell 
density, the selected promoter is induced by appropriate 
means (e.g., temperature shift or chemical induction) and 
20 cells are cultured for an additional period. 


Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting 
crude extract retained for further purification 
25 

Microbial cells employed in expression of proteins 
can be disrupted by any convenient method, including 
freeze- thaw cycling, sonication, a french press, 

mechanical disruption, or use of cell lysing agents, such 
30 methods are well know to those skilled in the art. 
However, preferred are host cells which secrete the 
polypeptide of the invention and permit recovery of the 
p°lypeptide from tlie culture media. 


Various mammalian cell culture systems can also be 
employed to express recombinant protein. Examples of 
mammalian expression systems include the COS -7 lines of 
monkey kidney fibroblasts, described by Gluzman, cell, 
23:175 (1981), and other cell lines capable of expressing 
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a compatible vector, for example, the C127, 3T3, CHO, HeLa 

and BHK cell lines. Mammalian expression vectors will 
comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, 

5 polyadenylation site, splice donor and acceptor sites, 
transcriptional termination sequences, and 5' flanking 
nontranscribed sequences . DNA sequences derived from the 
SV40 splice, and polyadenylation sites may be used to 
provide the required nontranscribed genetic elements. 

10 

The polypeptides can be recovered and/or purified 
from recombinant cell cultures by well-known protein 
recovery and purification methods. Such methodology may 
include ammonium sulfate or ethanol precipitation, acid 
^5 extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Protein 

refolding steps can be used, as necessary, in completing 
20 configuration of the mature protein. In this respect, 
chaperones may be used in such a refolding procedure . 
Finally, high performance liquid chromatography (HPLC) can 
be employed for final purification steps. 


25 


30 


35 


The polypeptides that are useful as immunogens in 
the present invention may be a naturally purified product, 
or a product of chemical synthetic procedures, or produced 
by recombinant techniques from a prokaryotic or eukaryotic 
host ( for example , by bacterial , yeast , higher plant , 
insect and mammalian cells in culture) . Depending upon 
the host employed in a recombinant production procedure, 
the polypeptides of the present invention may be 
glycosylated or may be non-glycosylated. Particularly 

preferred immunogens are truncated pneumococcal 

polypeptides that comprise a single highly conserved alpha 
helical area, but do not comprise a choline binding region 
or a HPS region. Therefore, antibodies against such 

polypeptides should bind to other pneumococcal bacterial 
species (in addition to the S. pnevmoni&e species from 
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which such polypeptides were derived) and vaccines against 
such S. pneumoniae should give protection against other 
pneumococcal bacterial infections. 

5 Procedures for the isolation of the individually 

ex P resse ^ alpha-helical containing polypeptides may be 
isolated by recombinant expression/isolation methods that 
are well-known in the art. Typical examples for such 
isolation may utilize an antibody to a conserved area of 
Protein or to a His tag or cleavable leader or tail 
that is expressing as part of the protein structure. 

The polypeptides, their fragments or other 

derivatives, or analogs thereof, or cells expressing them 
15 can be used as an immunogen to produce antibodies thereto. 
These antibodies can be, for example, polyclonal or 
monoclonal antibodies . The present invention also 

includes chimeric, single chain, and humanized antibodies, 
as well as Fab fragments, or the product of an Fab 
20 expression library. Various procedures known in the art 

may be used for the production of such antibodies and 
fragments . 

Antibodies generated against the polypeptides 
25 corresponding to a sequence of the present invention pgn 
be obtained by direct injection of the polypeptides into 
an animal or by administering the polypeptides to an 
animal, preferably a nonhuman . The antibody so obtained 
will then bind the polypeptides itself. in this manner, 
30 even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding 
the whole native polypeptides . 

preparation of monoclonal antibodies, any 
35 technique which provides antibodies produced by continuous 
cell line cultures can be used. Examples include the 
hybridoma technique (Kohler and Milstein, 1975, Nature, 
256:495-497), the trioma technique, the human B-cell 
hybridoma technique (Kozbor et al., 1983, Immunology Today 
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4:72), and the EBV-hybridoma technique to produce human 
monoclonal antibodies (Cole, et al . , 1985, in Monoclonal 

Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77- 
96) . 

5 

Techniques described for the production of single 
chain antibodies (U.S. Patent 4,946,778) can be adapted to 
produce single chain antibodies to immunogenic polypeptide 
products of this invention. Also, transgenic mice may be 
10 used to express humanized antibodies to immunogenic 
polypeptide products of this invention. 

In order to facilitate understanding of the above 
description and the examples which follow below, as well 
15 as the Figures included herewith. Table 1 below sets forth 
the bacterial source for the polypeptides of SEQ ID NOS: 3- 
18 and the polynucleotides encoding them (SEQ ID NOS: 20- 
35, respectively) . The name and/or type of bacteria is 
specified and a credit or source is named. The sequences 
20 from such types of bacteria are for illustrative purposes 
only since by utilizing probes and/or primers as described 
herein other sequences of similar type may be readily 
obtained by utilizing only routine skill in the art. 

25 TABLE 1 


SEQ ID NO. 

Type Of 
Pneumococcus 

Source Credit or ATCC No. 

3 

1 

ATCC 33400 

4 

2 

SPATCC 11733 

5 

2 

ATCC2 (catalog #6302) 

6 

4 

ATCC 4 (catalog #6304) 

7 

6B 1 

ATCC 6B (catalog #6326) 

8 

18C 

SPATCC 18C 

(ATCC catalog #10356) 
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TABLE 1 
(Continued) 


SEQ ID NO. 

Type Of 
Pneumococcus 

Source Credit or ATCC No. 

9 

4 

Norway type 4 ; Nat ' 1 . Inst . 
of Public Health, Norway 
Ingeborg Aagerge 

10 

noncapsulated 

R6X; Rockefeller Univ. , Rob 
Masure (from D39, type 2) 

11 

6B 

SPB 105; Boston Univ., 
Steve Pel ton 

12 

23F 

SPB 328; Boston Univ., 
Steve Pelton 

13 

14 

SPB 331; Boston Univ., 
Steve Pelton 

14 

23F 

SPB 365; Boston Univ. , 
Steve Pelton 

15 

9V 

SPR 332; Rockefeller Univ., 
Rob Masure 

16 

6B 

SPSJ 2p; St. <Jude Children's 
Research Hospital, Pat Flynn 
(clinical isolate passaged lx 
in mice for virulence) 

17 

14 

SPSJ 9; St. Jude Children's 
Research Hospital, Pat Flynn 
(clinical isolate - nares, 
pneumonia) 

18 

19A 

SPSJ 12 ; St . Jude Children ' s 
Research Hospital, Pat Flynn 
(clinical isolate) 


The present invention will be further described with 
reference to the following examples; however, it is to be 
understood that the present invention is not limited to 
10 such examples . All parts or amounts, unless otherwise 
specified, are by weight. 


UNSDOCID <WO__9951 266A2_I_> 


35 



BNSOOCID: <WO_ 


WO 99/51266 


PCT/US99/07680 


Example 1 

Generation of CbpA Truncate Protein Vectors 

A. Vector for Full Length CbpA (NR1XR2PC) 

5 A virulent serotype 4 S. pneumoniae strain, Norway 4 

(obtained from I. Aaberge, National Instute of Public 
Health, Oslo, Norway) was used as a source of genomic DNA 
template for amplifying the polynucleotide encoding full- 
length CbPA. Full length CbpA was amplified with PCR 

10 primers SJ533 and SJ537 described below. 

The degenerate forward primer SJ533 was designed 
based on the CbpA N- terminal sequence XENEG provided by 
H.R. Masure (St. Jude Childern's Research Hospital, 

15 Memphis, TN) . The SJ533 primer = 5' GGC GGA TCC ATG 
GA (A, G) AA (C, T) GA (A, G) GG 3'. It incorporates both 
BamHI and Ncol restriction sites and an ATG start codon. 

20 The 3' reverse primer SJ537 = 5' GCC GTC GAC TTA GTT 

TAC CCA TTC ACC ATT GGC 3' 

This primer incorporates a Sail restriction site for 
cloning purposes, and the natural stop codon from CbpA, 
and is based on type 4 and R6X sequence generated in- 

25 house. 

PCR product generated from genomic DNA template with 
SJ533 and SJ537 was digested with BamHI and Sail, and 
cloned into the pQE30 expression vector (Qiagen, Inc.) 

3 0 digested with BamHI, Xbal, and Smal. The type R6X 
template resulted in full-length vector PMI581 and the 
type 4 template DNA resulted in full-length vector 
PMI580 . 

35 B. Vector for CbpA Truncate Protein (NR1XR2) 

The naturally occurring PvuII site in the end on the 
second amino repeat (nucleic acid 1228 of Type 4 
sequence) was exploited to create a truncated version of 
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CbpA, containing only the amino terminus of the gene. To 
create the truncate clone, the full-length clone PMI580 
(Type 4) or PMI581 (R6X) was digested with PvuII and 
Xbal, and the amino terminus along with a portion of the 
5 expression vector was isolated by size on an agarose gel. 
pQE3 0 was digested with Xbal and Smal, and the band 
corresponding to the other half of the vector was also 
size selected on an agarose gel. The two halves were 
ligated and clones identified by restriction digest, then 
10 expressed. In this instance, the stop codon utilized is 
in the expression vector , so the protein expressed is 
larger than the predicted size due to additional amino 
acids at the 5' and 3' end of the cloning site . 

15 C. Vector for CbpA Truncate Protein (NR1X) 

A similar strategy was used to express only the 
first amino repeat of CbpA. Here the naturally occurring 
Xu™! site between the two amino repeats (nucleic acid 856 
of Type 4 sequence) was utilized. CbpA full-length clone 
20 PMI580 was digested with Xmnl and Aatll. Expression 
vector pQE30 was digested with Aatll and Smal. Once 
again, the two sized fragments were ligated, and clones 
were screened by restriction digest and expressed. 

2 5 Example 2 

Expression of CbpA Truncate Protein From Expression 
Vectors 


All proteins are expressed from the vectors 

30 described in Examples 1A-1C in the Qia expressionist 
System (Qiagen) using the E. coli expression vector 
PQE30, and the amino terminus His6 tagged proteins are 
detected by western analysis using both anti -Histidine 
antibodies and gene specific antibodies. 

35 

The expressed CBP truncates were purified as 

follows . 

A single colony was selected from plated bacteria 
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containing the recombinant plasmid and grown overnight at 
37° in 6.0 ml LB buffer with 50 ug/ml Kanamycin and 100 
ug/ml Ampicillin. This 6.0 ml culture was added to 1L LB 
with antibiotics at above concentrations. The culture 
5 was shaken at 37°C until A, 00 = -0.400. 1M IPTG was added 
to the 1L culture to give a final concentration of ImM. 
The culture was then shaken at 37°C for 3-4 h. The 1L 
culture was spun 15 min. in 250 ml conical tubes at 4000 
rpm in a model J-6B centrifuge. The supernatant was 

xo discarded and the pellet stored at -20°C until use. 

The 1L pellet was resuspended in 25 ml 50 mM 
NaH 2 P0 4 , lOmM Tris, 6M GuCl, 300mM NaCI, pH 8.0 (Buffer 
A) . This mixture was then rotated at room temperature 
X 5 for 30 minutes. The mixture was then subjected to 

sonication (VibraCell Sonicator, Sonics and Materials, 
Inc., Danbury, CT) using the microtip, two times, for 3 0 
sec., at 50% Duty cycle and with the output setting at 7. 
The mixture was spun 5 min. at 10K in a JA2 0 rotor and 
20 the supernatant removed and discarded. The supernatant 

was loaded on a 10 ml Talon (Clonetech, Palo Alto, CA) 
resin column attached to a GradiFrac System (Pharmacia 
Biotech, Upsala, Sweden) . The column was equilibrated 
with 100 ml Buffer A and washed with 200 ml of this 
25 buffer. A volume based pH gradient using 100% 50 mM 

NaH 2 P0 4 , 8M Urea, 20mM MES, pH 6.0 (Buffer B) as the 
final target buffer was run over a total volume of 100 
ml. Protein eluted at - 3 0% Buffer B. Eluted peaks were 
collected and pooled. 

30 

For refolding, dialysis was carried out with a 2L 
volume of PBS at room temperature for approximately 3 hr. 
using dialysis tubing with a molecular weight cutoff of 
14,000. The sample was then dialyzed overnight in 2L of 

35 PBS at 4°C . Additional buffer exchange was accomplished 
during the concentration of the protein using Centriprep- 
30 spin columns by adding PBS to the spun retentate and 
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respinning . The protein concentration was determined 
using the BCA protein assay and the purity visualized 
using a Coomassie stained 4-20% SDS-PAGE gel. 


5 


Example 3 

Pass ive Protection with Anti-CbpA Truncate NR1XR2 
Antisera 


10 A. Generation of Rabbit Immune Serum 

Rabbit immune serum against CbpA truncate was 
generated at Covance (Denver, PA) . Following collection 
of preimmune serum, a New Zealand white rabbit (#ME101) 
was immunized with 250 jag CbpA truncate NR1XR2 

15 (containing both alpha helix I and alpha helix II amino 
acid N-terminal repeats that are prepared from 483:58) in 
Complete Freund's Adjuvant. The rabbit was given a boost 

of 125 jxg CbpA truncate in Incomplete Freund's Adjuvant 
on day 21 and bled on days 31 and 52. 

20 

B. Passive Protection in Mice 

C3H/HeJ mice (5 mice/group) were passively immunized 
intraperitoneally with 100 jxl of a 1:2 dilution of rabbit 
sera sterile PBS (preimmune or day 31 immune sera) . 

25 One hour after administration of serum, mice were 
challenged with 1600 cfu virulent serotype 6B s. 
pneumoniae, strain SP317 (obtained from H.R. Masure) . 
Mice were monitored for 14 days for survival . 

30 Eighty percent of the mice immunized with rabbit 

immune serum raised against CbpA truncate NR1XR2 protein 
survived the challenge for 14 days (Figure 2) . All mice 
immunized with preimmune rabbit serum were dead by day 7. 

35 

C. Passive Protection in Mice (Higher Challenge 

Dose) 
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C3H/HeJ mice (10 mice/group) were passively 
immunized intraperitoneally with 100 fil of a 1:2 dilution 
of rabbit sera in sterile PBS (preimmune or day 52 immune 
sera) . One hour after administration of serum, mice were 
5 challenged with 3450 cfu virulent serotype 6B S. 
pneumoniae, strain SP317. Mice were monitored for 10 

days for survival. 

One hundred percent of the mice immunized with 
10 rabbit immune serum raised against CbpA truncate NR1XR2 
protein survived the challenge for ten days (Figure 3) . 
Ninety percent of the mice immunized with preimmune 
rabbit serum were dead at day 10. 


15 d . Passive Protection in Mice (Against High 

Virulence) 

C3H/HeJ mice (10 mice/group) were passively 
immunized intraperitoneally with 100 ^xl of a 1:2 dilution 
of rabbit sera in sterile PBS (preimmune or day 52 immune 
20 sera) . One hour after administration of serum, mice were 
challenged with 580 cfu virulent serotype 6B S. 
pneumoniae, strain SPSJ2 (provided by P. Flynn, St. Jude 
Children's Research Hospital, Memphis, TN) . Mice were 
monitored for 10 days for survival. 

25 

Fifty percent of the mice immunized with rabbit 
immune serum raised against CbpA truncate NR1XR2 protein 
survived the challenge for 10 days (Figure 4) . All of 
the mice immunized with preimmune rabbit serum were dead 
30 at day 8 . 


These data demonstrate that antibodies specific for 
CbpA are protective against systemic pneumococcal 
infection. The data further indicate that the choline- 
35 binding region is not necessary for protection, as 
antibody specific for truncated protein NR1XR2, lacking 
the choline-binding repeats, was sufficient for 
protection. In addition, serum directed against 
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recombinant CbpA protein based on a serotype 4 sequence, 
was still protective against challenge with two different 
strains of serotype 6B. 


5 


Active 

NR1XR2 


Example 4 

P rotection — with Anti -CbpA Truncates NR1X and 


10 


15 


20 


25 


A. Active Protection With NR1X Truncate 
C3H/HeJ mice (10/group) were 

intraperitoneally with CbpA truncate protein 


Vaccination 
immunized 
NR1X (15pg 


in 50 pi PBS, 
group of 10 
adjuvant. A 


P^- us 50 pi Complete Freund's Adjuvant) . a 
sham immunized mice received PBS and 
second immunization was administered four 


weeks later, 15 pg protein i.p. with Incomplete Freund's 
Adjuvant (sham group received PBS plus IFA) . Blood was 
drawn (retro-orbital bleed) at weeks 3, 6, and 9 for 

analysis of immune response. The ELISA end point anti- 
CbpA truncate titer of pooled sera from the 10 CbpA 
immunized mice at 9 weeks was 4,096,000. No antibody was 
detected in sera from sham immunized mice. Mice were 
challenged at week 10 with 560 CFU serotype 6 B S. 

pneumoniae strain SPS J2 . Mice were monitored for 14 days 
for survival. 


Eighty percent of the mice immunized with CbpA 
truncate protein NR1X survived the challenge for 14 Days 

(results shown in Figure 5) . All sham immunized mice 
were dead by day 8 . 


35 


B, 


Active 


Protection 


with 




rruncace 


Vaccination 

C3H/HeJ mice (10/group) were immunized 
intraperitoneally with CbpA truncate protein NR1XR2 (15pg 
in 50 pi PBS, plus 50 pi Complete Freund's Adjuvant) . a 
group of 10 control immunized mice received pneumococcal 
recombinant protein SP90 and adjuvant. a second 
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immunization was administered four weeks later, 15^g 
protein i.p. with Incomplete Freund's Adjuvant. Blood 
was drawn (retro- orbital bleed) at weeks 3, 6, and 9 for 

analysis of immune response. The ELISA end point anti- 
5 cbpA truncate titer of pooled sera from the 10 CbpA 
immunized mice at 9 weeks was 4,096,000. Mice were 
challenged at week 10 with 680 CFU serotype 6B S. 
pneumoniae strain SPSJ2 . Mice were monitored for 14 days 
for survival . 


10 


15 


20 


25 


Fifty percent of the mice immunized with CbpA 
truncate protein NR1XR2 survived the challenge for 14 
days (results shown in Figure 6) . All control immunized 
mice were dead by day 9 . 


These data demonstrate that immunization with 
recombinant CbpA truncate proteins elicit production of 
specific antibodies capable of protecting against 
systemic pneumococcal infection and death. The data 
further indicate that the choline -binding region is not 
necessary for protection, as the immunogens were 
truncated proteins NR1X and NR1XR2 . Additionally, the 
results suggest that a single amino terminal repeat may 


be sufficient to elicit a protective response. Cross 
protection is demonstrated as the recombinant 
pneumococcal protein was generated based on serotype 4 
DNA sequence and protection was observed following 
challenge with a serotype 6B isolate. 


30 

Numerous modifications and variations of the present 
invention are possible in light of the above teachings 
and, therefore, within the scope of the appended claims, 
the invention may be practiced otherwise than as 
35 particularly described. 
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WHAT IS CLAIMED IS: 


1. A vaccine against bacterial infections 
comprising an immunogen which is a polypeptide truncate 
5 of a pneumococcal surface-binding protein, analog or 
variant having a highly conserved immunogenic alpha- 
helical portion with respect to different types of 

pneumococcal bacteria, which polypeptide does not include 
a choline -binding portion. 

10 

2. A vaccine according to claim l, wherein 
the amino acid sequence of said alpha-helical portion has 
at least 75 % identity with respect to the amino acid 
sequence of SEQ ID NO:l. 

15 

3. A vaccine according to claim 1, wherein 
the amino acid sequence of said alpha-helical portion has 
at least 85 % identity with respect to the amino acid 
sequence of SEQ ID NO:i. 

20 

4. A vaccine according to claim 1, wherein 
the amino acid sequence of said alpha-helical portion has 
at least 90 % identity with respect to the amino acid 
sequence of a member consisting of: 

the ^i 110 acid sequence of SEQ ID NO:l, and 
(b) the amino acid sequence of SEQ ID NO: 19 


5. A vaccine according to claim 1, wherein 
the amino acid sequence of said alpha-helical portion has 
at least 95 % identity with respect to the amino acid 
sequence of a member selected from the group consisting 

(a) the ammo acid sequence of SEQ id NO:l, and 

(b) the amino acid sequence of SEQ ID NO: 19 . 

35 

6. A vaccine according to claim 1, wherein 
said vaccine is for preventing or treating otitis media, 
sepsis, meningitis and lobar pneumonia infections. 
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7 . 

said vaccine 


A vaccine according to claim 6, wherein 
is for invasive infections. 


8 . A vaccine according to claim 6 , wherein 

5 said vaccine is for otitis media infections caused by S. 
pneumoniae. 

9 . A vaccine according to claim 1 , wherein 
said polypeptide truncate comprise an amino acid sequence 

10 which has at least 90 % identity with respect to a member 
selected from the group consisting of the ammo acid 
sequences of each of SEQ ID NOS: 3 to 18. 

10. A vaccine according to claim 1, wherein 

15 said polypeptide truncate comprise an amino acid sequence 
which has at least 95 % identity with respect to a member 
selected from the group consisting of the amino acid 
sequences of each of SEQ ID NOS: 3 to 18. 

20 11. An antibody raised against an immunogen 

which is a polypeptide truncate of a pneumococcal 
surface -binding protein, analog or variant having a 
highly conserved immunogenic alpha- helical portion with 
respect to different types of pneumococcal bacteria, 

25 which polypeptide does not include a choline -binding 
portion. 


12. An antibody according to claim 11, wherein 
the amino acid sequence of said alpha-helical portion has 

3 0 at least 85 % identity with respect to the amino acid 
sequence of SEQ ID NO : 1 . 

13 . An antibody according to claim 11 , wherein 
the amino acid sequence of said alpha- helical portion has 

35 at least 90 % identity with respect to the amino acid 
sequence of a member 

selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:l, 

and 
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(b) the amino acid sequence of SEQ ID NO: 19. 


14. An antibody according to claim 11, wherein 
the amino acid sequence of said alpha-helical portion has 
5 at least 95 % identity with respect to the amino acid 

sequence of a member selected from the group consisting 
Of : 

(a) the amino acid sequence of SEQ ID NO:l, 

and 

10 the amino acid sequence of SEQ ID NO: 19. 

15. An antibody according to claim 11, wherein 
said polypeptide truncate comprise an amino acid sequence 
which has at least 95 % identity with respect to a member 

15 selected from the group consisting of the amino acid 
sequences of each of SEQ ID NOS: 3 to 18. 

16. An antibody according to claim 11, wherein 
sa ^- < ^ antibody is an antibody that will detect S. 

20 pneumoniae infections. 

17. An antibody according to claim 15, wherein 
said antibody is effective for the prevention and/or 
treatment of s. pneumoniae infections. 

25 

18. An antibody according to claim 15, wherein 

said antibody is effective for the prevention and/or 
treatment of pneumococcal infections caused by types 1-5, 
6A, 6B, 7F, 8, 9N, 9V , 10A, 11A, 12F, 14, 15B, 17F, 18C, 

30 19F, 19A, 20, 22F, 23F and 33F S. pneumoniae bacteria. 


19. A method for preventing and/or treating 
pneumococcal infections in a host comprising immunizing 
said host with a member selected from the group 
35 consisting of: 

(a) a vaccine according to claim 2, and 

3-t least one antibody raised against an 
immunogen which is a polypeptide truncate of a 
pneumococcal surface -binding protein, analog or variant 
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comprising an amino acid sequence that is has at least 90 
% identity to the amino acid sequence of a member 
selected from the group consisting of SEQ ID NO: 3 to 18, 
which polypeptide does not include a choline -binding 
5 portion. 

20. A polypeptide comprising an amino acid 
sequence which has at least 90 % identity with respect to 
a member selected from the group consisting of the amino 

10 acid sequences of each of SEQ ID NOS: 3 to 18. 

2 1. An isolated polynucleotide comprising 
polynucleotide sequence having at least 90 % identity to 
a member selected from the group consisting , of : 

15 (a) a polynucleotide coding sequence encoding 

a polypeptide comprising a member selected from the group 
consisting of the amino acid sequences of each of SEQ ID 

NOS: 3 to 18, and 

(b) and the complement of (a) . 
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BNSDOC1D: <WO_ 


SEQUENCE LISTING 


<110> Wizemann, Theresa M. 

Koenig, Scott 
Johnson, Leslie S 

<12 0> Derivatives of Choline Binding Proteins for Vaccines 

<130> 469201-364 

<140> 

<14 1 > 


<150> US 60/085,743 
<151> 1998-05-15 

<160> 38 


<170> MS-Word (DOS Text) 

<210> 1 
<211> 103 
<212 > PRT 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: Amino acid sequence derived from a cDNA i-u 

Streptococcus pneumoniae fr ° m the genome of 

<4 00> 1 

Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Lys Lys Lys 
5 10 15 

Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr 
20 25 30 

Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys 
35 4 0 45 

Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Ser Arg Asn 

55 60 

Glu Glu Lys He Lys Gin Ala Lys Ala Lys Val Glu Ser Lys Lys Ala 

70 7 5 80 

Glu Ala Thu Arg Lau Glu Lys II. Lys Thr Aap Arg Lys Ly. Ala Glu 
85 90 95 

Glu Glu Ala Lys Arg Lys Ala 
100 


<210> 2 
<211> 141 
<212> PRT 

<2X3> Artificial Sequence 


< 220 > 
<223 > 


Description: Amino acid sequence derived from 
streptococcus pneumoniae 


a 


cDNA from the genome of 


<400> 2 


1 


_995 1 266A2_ I > 
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Glu Ala Lys Arg Lys Ala Glu Glu Ser Glu Lys Lys Ala Ala Glu Ala 
1 5 10 

Lys Gin Lys Val Asp Ala Glu Glu Tyr Ala Leu Glu Ala Lys He Ala 
20 25 3° 

Glu Leu Glu Tyr Glu Val Gin Arg Leu Glu Lys Glu Leu Lys Glu lie 


Asp Glu Ser Asp Ser Glu Asp Tyr 
.n 55 


Leu Lys Glu Gly Leu Arg Ala Pro 
60 


Leu Gin Ser Lys 
65 


Leu Asp Thr Lys Lys Ala Lys Leu Ser Lys Leu Glu 


Glu Leu Ser Asp Lys lie Asp 
85 


Glu Leu Asp Ala Glu lie Ala Lys Leu 

on 95 


Glu Val Gin Leu Lys Asp Ala Glu Gly Asn Asn Asn Val Glu Ala Tyr 
100 105 

Phe Lys Glu Gly Leu Glu Lys Thr Thr Ala Glu Lys Lys Ala Glu Leu 
115 . 120 - 125 

Glu Lys Ala Glu Ala Asp Leu Lys Lys Ala Val Asp Glu 
130 135 140 


<210> 3 
<211> 431 
<212> PRT 

<213 > Artificial Sequence 


<223 > Description: Amino acid sequence 
Streptococcus pneumoniae 


derived from a cDNA from the genome of 


Tto°Glu Lys Glu Val Thr Thr Pro Val Ala Thr Ser Ser Asn Lys Ala 


Asn Lys Ser 


Gin Thr Glu His Met Lys Ala Ala Glu Gin Val Asp Glu 


Tyr He Asn Lys Met He Gin Leu Asp Lys Arg Lys His Thr Gin Asn 
35 40 45 

Leu Ala Leu Asn He Lys Leu Ser Ala He Lys Thr Lys Tyr Leu Arg 


Glu Leu Asn 
65 


Val Leu Glu Glu Lys Ser Lys Lys Glu Glu Leu Thr Ser 
in 75 80 


Lys Thr Lys Lys Glu lie Asp Ala Ala Phe Glu Gin Phe Asn Lys Asp 
85 90 

Thr Leu Lys Pro Gly Glu Lys Val Glu Glu Ala Glu Lys Lys Val Glu 
100 105 

Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp His Arg Asn 
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Tyr Pro Thr lie Thr Tyr Lys Thr Leu Glu 


13 0 


135 


Leu Glu lie Ala Glu Ser 


140 


Ts V * 1 S1 “ V * 1 ^ To Ala ° 1U ° 1U ^ Vsl W. Mu Glu Ala 


155 


160 


Lys Gly Ser Arg Asn Glu Glu Lys lie Lys Lys Ala 


165 


170 


Lys Ala Glu Val 
175 


Glu Ser Lys Lys Ala Glu Ala Thr Lys Leu Glu 


180 


185 


Glu lie Lys Thr Glu 


190 


Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala 


200 


Glu Ala Glu Glu 
205 

Mu val Ays Asn Ays lau Ays Ays Arg Thr Ays Arg G ly Ala Aha Gl y 

215 220 

Glu Pro Ala Thr Pro Asp Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp 

235 240 

Ser ser Val Val Lys Lys Ser Ser Lys Pro He Leu Lys Ser Glu Lys 

25° 255 

!"■ Val “* "* Glu ^ «* Mu Ala Glu Ays A y s Val 

265 270 

Aia Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg 

280 285 

Asn Tyr Pro Thr Asn Thr Tyr Ays Thr Ann Glu Aeu Glu II. Ala Glu 

295 300 

S.r Asp val Ays Val Ays Glu Ala Glu Aau Glu A.u Val Ays Glu Glu 

320 

Ala Lys Glu Pro Gin Asn Glu Glu Lys lie Lys Gin Ala Lys Ala Lys 

330 335 

val Glu Gar Ay, Ays Ala Glu Ala Thr Arg Aau Glu Ays II. Ays Thr 

345 , 350 

ASP Arg Ays Ays Ala Glu Glu Ala Ays Arg Ay. Val Ala Glu Glu Asp 

360 365 

Ays val Ay. Glu Ays Pro Ala Glu Gin Pro -Gin Pro Ala Pro Ala Pro 

380 

Ays Pro Ala Pro Ala Pro Gin Pro Glu Ays Pro Ala Glu Gin Pro Ay. 

395 400 

Ala Glu Ays Pro Ala Asp Gin Gin Ala Glu Gl. Asp Tyr Ala Arg Arg 

410 415 

Ser Glu Glu Glu Tyr Asn Pro Leu Asp Leu Thr Ala Pro Ala Lys 


425 


430 


< 210 > 4 
<211> 251 
<212> PRT 

<213> Artificial Sequence 
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< 220 > 

<223 > Description: Amino 

Streptococcus pneumoniae 


acid sequence derived from a cDNA from the genome of 


<4 ° 0> 4 Qlu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala 


Thr Glu Asn 
1 


10 


Lys Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu Tyr He 
20 25 30 

Glu Lys Met Leu Arg Glu lie Gin Leu Asp Arg Arg Lys His Thr Gin 
35 40 

Asn Val Ala Leu Asn lie Lys Leu Ser Ala lie Lys Thr Lys Tyr Leu 
50 55 60 

Arg Glu Leu Asn Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser 


65 


Glu lie Lys 


70 


Ala Lys Leu Asp Ala Ala Phe Glu Lys Phe Lys Lys Asp 


85 


90 


Thr Leu 


Lys Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu 
100 1°5 HO 

Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn 
115 12° 125 

Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Phe 
1 i *3 c 140 

130 

Asp Val Lys val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala 


145 


Lys 


150 


Glu Ser Arg Asn Glu Gly Thr lie Lys Gin Ala Lys Glu Lys Val 


165 


170 


Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp 
180 1« 5 190 

Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp 


Lys 


195 200 

Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Thr- 


210 


215 


Gin Pro Glu Lys Pro Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin Pro 


225 


230 


235 


Lys Ala Glu Lys Thr Asp Asp Gin Gin Ala Glu 


245 


<210> 5 
<211> 413 
<212> PRT 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: Amino 


acid sequence derived from a cDNA from the genome 
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Streptococcus pneumoniae 


< 400 > 5 

Thr Glu Lys Glu 
1 

hys Thr Glu His 
20 

Glu Lys Met Leu 
35 

Asn Phe Ala Phe 
50 

Tyr Gly Leu Lys 
65 

Ala Lys Leu Asp 


Pro Gly Glu Lys 
100 

Lys Lys Ala Lys 
115 

Asn Thr Tyr Lys 
13 0 

Val Lys Lys Ala 
145 

Asn Glu Asp Thr 


Ala Glu Ala Thr 
180 

Glu Glu Glu Ala 
195 

Asp Lys Leu Lys 
210 

Thr Phe Phe Lys 
225 

Gly Glu Glu Thr 


Ala Glu Ala Glu 
260 

Gin Lys Glu Glu 
275 

Leu Asp Leu Glu 
290 

Leu Glu Leu Val 
305 


Val Thr Thr Gin 
5 

Arg Lys Ala Ala 


Arg Glu lie Gin 
40 

Asn Met Lys Leu 
55 

Glu Lys Ser Glu 
70 

Ala Ala Phe Glu 
85 

Val Ala Glu Ala 


Ala Gin Lys Glu 
120 

Thr Leu Glu Leu 
135 

Glu Leu Glu Leu 
150 

lie Asn Gin Ala 
165 

Leu Lys Glu Glu 


Lys Arg Lys Ala 
200 

Arg Arg Thr Lys 
215 

Lys Glu Asn Asp 
230 

Leu Pro Ser Pro 
245 

Lys Lys Val Ala 


Asp Arg Arg Asn 
280 

lie Ala Glu Ser 
295 

Lys Glu Glu Ala 
310 


Val Pro Thr Tyr 
10 

Lys Gin Val Val 
25 

Leu Asp Arg Arg 


Ser Ala lie Lys 
60 

Ala Glu Leu Pro 
75 

Gin Phe Lys Lys 
90 

Lys Lys Lys Val 
105 

Glu Asp Arg Arg 


Glu lie Ala Glu 
140 

Leu Lys Glu Glu 
155 

Lys Ala Lys Val 
170 

lie Lys Thr Asp 
185 

Glu Ala Glu Glu 


Arg Ala Val Pro 
220 

Ala Lys Ser Ser 
235 

Ser Leu Lys Ser 
250 

Glu Ala Glu Lys 
265 

Tyr Pro Thr Asn 


Asp Val Lys Val 
300 

Lys Gly Ser Arg 
315 


Ser Asn Met Ala 
15 

Asp Glu Thr lie 
30 

Lys His Thr Gin 
45 

Thr Glu Tyr Leu 


Ser Glu Val Lys 
80 

Asp Thr Leu Lys 
95 

Ala Glu Ala Glu 
110 

Asn Tyr Pro Thr 
125 

Ser Asp Val Glu 


Ala Lys Thr Arg 
160 

Glu Ser Lys Lys 
175 

Arg Lys Lys Ala 
190 

Asp Lys Val Lys 
205 

Gly Glu Pro Ala 


Asp Ser Ser Val 
240 

Gly Lys Lys Val 
255 

Lys Ala Lys Asp 
270 

Thr Thr Lys Thr 
285 

Lys Glu Ala Glu 


Asn Glu Glu Lys 
320 


_9951266A2J_> 
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lie Asn Gin Ala Lys Ala Glu Val Glu Ser Lys Lys Ala Glu Ala Thr 
325 330 335 

Arg Leu Glu Lys Thr Lys Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala 
340 345 350 

Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys Glu Lys Pro Ala Glu 
355 360 365 

Gin Pro Gin Pro Ala Pro Ala Pro Gin Pro Glu Lys Pro Thr Glu Glu 
370 375 380 

Pro Glu Asn Pro Ala Pro Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin 
385 390 395 400 

Pro Lys Ala Glu Lys Thr Asp Asp Gin Gin Ala Glu Glu 
405 410 


< 210 > 6 
<211> 446 
<212> PRT 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: Amino acid sequence derived from a cDNA from the genome of 
Streptococcus pneumoniae 

<400> 6 

Thr Glu Asn Glu Gly Ala Thr Gin Val Pro Thr Ser Ser Asn Arg Ala 
15 10 15 

Asn Glu Ser Gin Ala Glu Gin Gly Glu Gin Pro Lys Lys Leu Asp Ser 
20 25 30 

Glu Arg Asp Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys lie 
35 40 45 

Val Gly Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr lie Thr 
50 55 60 

Val Ala Leu Val Asn Glu Leu Asn Asn He Lys Asn Glu Tyr Leu Asn 
65 70 75 80 

Lys lie Val Glu Ser Thr Ser Glu Ser Gin Leu Gin lie Leu Met Met 
85 90 95 

Glu Ser Arg Ser Lys Val Asp Glu Ala Val -Ser Lys Phe Glu Lys Asp 
100 105 110 

Ser Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser 
115 120 125 

Asp Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala 
130 135 140 

Glu Ala Lys Lys Lys Val Glu Glu Val Glu Lys Lys Ala Lys Asp Gin 
145 150 155 160 

Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu 
165 170 175 
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Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu 
180 185 a90 

Glu Leu Val Lys Val Lys Ala Asn Glu Pro Arg Asp Lys Gin Lys lie 
195 200 205 

Lys Gin Ala Glu Ala Glu Val Glu Ser Lys Gin Ala Glu Ala Thr Arg 
210 a 


215 


220 


Leu Lys Lys lie Lys Thr Asp Arg Glu Glu Ala Glu Glu Glu Ala Lvs 

225 230 235 240 

^9 Arg Ala Asp Ala Lys Glu Gin Gly Lys Pro Lys Gly Arg Pro Lys 


245 


250 


255 


Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp 
260 265 270 

Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro Ser Pro 
275 280 285 

Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu 
290 295 300 

Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn 

3 05 310 oic 

315 320 

Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser 
325 330 335 

Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala 
340 345 350 

Lys Glu Pro Arg Asn Glu Glu Lys Val Lys Gin Ala Lys Ala Glu Val 


355 


360 


365 


G1U 370 ^ LyS Ala G1U *1* Thr Ar9 LeU Glu LyS Ile Lys Thr Asp 


375 


380 


Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp 
385 ^ 


390 


395 


400 


Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro 
405 410 415 

Lys Thr Glu Lys Pro Ala Pro Ala Pro Lys Pro Glu Asn Pro Ala Glu 
42° 425 430 

Gin Pro Lys Ala Glu Lys Pro Ala Asp Gin Gin Ala Glu Glu 
435 440 445 


<210> 7 
<211> 428 
<212 > PRT 

<213 > Artificial Sequence 


< 220 > 
<223 > 


Description: Amino acid 
Streptococcus pneumoniae 


sequence derived from a cDNA from the genome of 
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< 400 > 7 

Glu Gly Val Arg Ser Gly Asn Asn Ser Thr Val Thr Ser Ser Gly Gin 

1 5 10 15 

Asp lie Ser Lys Lys Tyr Ala Asp Glu Val Glu Ser His Leu Gin Ser 

20 25 30 

lie Leu Lys Asp Val Asn Lys Asn Leu Lys Lys Val Gin His Thr Gin 
35 40 45 

Asn Ala Asp Phe Asn Lys Lys Leu Ser Lys lie Lys Thr Lys Tyr Leu 
50 55 60 

Tyr Glu Leu Asn Val Leu Glu Glu Lys Ser Glu Ala Glu Leu Thr Ser 
65 70 75 80 

Lys Thr Lys Glu Thr Lys Glu Glu Leu Thr Ala Ala Phe Glu Gin Phe 
85 90 95 

Lys Lys Asp Thr Leu Ser Thr Glu Pro Glu Lys Lys Val Ala Glu Ala 
100 105 110 

Lys Lys Lys Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu 
115 120 125 

Lys Asp Arg Arg Asn Tyr Pro Thr He Thr Tyr Lys Thr Leu Glu Leu 
130 135 140 

Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu 
145 150 155 160 

Val Lys Val Lys Ala Asn Glu Pro Arg Asp Glu Glu Lys lie Lys Gin 
165 170 175 


Ala Glu Ala Lys Val Glu Ser Lys Gin Ala Glu Ala Thr Arg Leu Lys 
180 185 190 

Lys He Lys Thr Asp Arg Glu Gin Ala Glu Ala Thr Arg Leu Glu Asn 
195 200 205 

Xle Lys Thr Asp Arg Glu Gin Ala Glu Glu Glu Ala Lys Val Lys Asp 
210 215 220 

Glu Pro Lys Lys Arg Thr Lys Arg Gly Val Leu Gly Glu Pro Ala Thr 
225 230 235 240 

Pro Asp Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly 
245 250 255 

Glu Glu Thr Leu Pro Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala 
260 265 270 

Glu Ala Glu Lys Lys Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin 
275 280 285 

Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu 
290 295 300 

Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu 
305 310 315 320 

Glu Leu Val Lys Glu Glu Ha Lys Glu Pro Arg Asn Glu Glu Lys Val 
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325 





330 





335 


Lys 

Gin 

Ala 

Lys 

340 

Ala 

Glu 

Val 

Glu 

Ser 

345 

Lys 

Gin 

Ala 

Glu 

Ala 

350 

Thr 

Arg 

Leu 

Glu 

Asn 

355 

He 

Lys 

Thr 

Asp 

Arg 

360 

Lys 

Lys 

Ala 

Glu 

Glu 

365 

Glu 

Ala 

Lys 

Arg 

Lys 

370 

Ala 

Ala 

Glu 

Glu 

Asp 

375 

Lys 

Val 

Lys 

Glu 

Lys 

380 

Pro 

Ala 

Glu 

Gin 

Pro 

385 

Gin 

Pro 

Ala 

Pro 

Ala 

390 

Pro 

Gin 

Pro 

Glu 

Lys 

395 

Pro 

Ala 

Pro 

Lys 

Asp 

400 

Glu 

Lys 

Pro 

Ala 

Pro 

405 

Ala 

Pro 

Lys 

Pro 

Glu 

410 

Asn 

Pro 

Ala 

Glu 

Gin 

415 

Pro 

Lys 

Ala 

Glu 

Lys 

Pro 

Ala 

Asp 

Gin 

Gin 

Ala 

Glu 

Glu 






420 425 


<210> 8 
<211> 219 
<212> PRT 

<213> Artificial Sequence 
< 220 > 

<223> description: Amino acid sequence derived from a cDNA from the genome of 
Streptococcus pneumoniae y U,L 

<400> 8 

Glu Gly Val Arg Ser Gly Asn Asn Ser Thr Val Thr Ser Ser Gly Gin 
15 10 15 


Asp lie Ser Lys Lys Tyr Ala Asp Glu Val Glu Ser His Leu Gin 


20 


25 


Ser 


30 


He Leu Lys Asp Val Asn Lys Asn Leu Lys Lys Val Gin His Thr Gin 
35 40 45 

Asn Ala Asp Phe Asn Lys Lys Leu Ser Lys lie Lys Pro Lys Tyr Leu 

55 60 

Tyr Glu Leu Lys Cys Leu Glu Glu Lys Ser Glu Ala Glu Leu Thr Ser 

65 70 75 80 

Lys Pro Lys Asn Lys Arg Arg Val Thr Ala Ala Phe Glu Gin Phe Lys 

85 90 95 

Lys Asp Thr Leu Ser Thr Glu Pro Glu Lys Lys Val Ala Glu Ala Lys 
100 105 110 

Lys Lys Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Lys 

115 i m 1 


120 


125 


Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu Glu 


130 


135 


Leu Glu 


140 


lie Ala Glu Ser Asp Val Glu Val 
145 


Lys Lys Ala Glu Leu Glu Leu Val 
150 «5 160 

Lys Glu Glu Ala Lys Glu Pro Arg Asn Glu Glu Lys Val Lys Gin Ala 


9951 266 A2_L> 
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165 





170 





175 


Lys 

Ala 

Glu 

Val 

180 

Glu 

Ser 

Lys 

Gin 

Ala 

185 

Glu 

Ala 

Thr 

Arg 

Leu 

190 

Glu 

Lys 

He 

Lys 

Thr 

195 

Asp 

Arg 

Lys 

Lys 

Ala 

200 

Glu 

Glu 

Glu 

Ala 

Lys 

205 

Arg 

Lys 

Ala 

Ala 

Glu 

210 

Glu 

Asp 

Lys 

Val 

Lys 

215 

Glu 

Lys 

Pro 

Ala 







<2X0> 9 
<211> 446 
<212> PRT 

<213 > Artificial Sequence 


< 220 > 


<223 > Description: Amino acid sequence derived from a cDNA from the genome of 
Streptococcus pneumoniae 


<400> 9 _ _ 

Thr Glu Asn Glu Gly Ala Thr Gin Val Pro Thr Ser Ser Asn Arg Ala 
15 10 15 

Asn Glu Ser Gin Ala Glu Gin Gly Glu Gin Pro Lys Lys Leu Asp Ser 
20 25 30 

Glu Arg Asp Lys Ala Arg Lys Glu Val Glu Glu Tyr Val Lys Lys lie 
35 40 45 

Val Gly Glu Ser Tyr Ala Lys Ser Thr Lys Lys Arg His Thr lie Thr 
50 55 60 

Val Ala Leu Val Asn Glu Leu Asn Asn lie Lys Asn Glu Tyr Leu Asn 
65 70 75 80 

Lvs lie Val Glu Ser Thr Ser Glu Ser Gin Leu Gin lie Leu Met Met 
85 90 95 

Glu Ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp 
100 105 HO 

Ser Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser 
115 120 125 

Asp Thr Ala Lys Pro Asn Lys Pro Thr Glu Pro Gly Glu Lys Val Ala 
130 135 140 

Glu Ala Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin 
145 150 155 160 

Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu 
165 170 175 

Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu 

185 190 


180 


Glu Leu Val Lys Val Lys 

3.95 200 


Ala Asn Glu Pro Arg Asp Glu Gin Lys lie 

205 


Lys Gin Ala Glu Ala Glu Val Glu Ser Lys Gin Ala Glu Ala Thr Arg 

10 
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210 


215 


220 


S Ays Ays II. Lye Thr Asp Arg Glu Glu Ale Glu Glu Glu Al. Ays 


235 


Arg Arg Ala Asp Ala Lys Glu Gin Gly Lys Pro Lys sly 


240 


250 


Arg Ala Lys 
255 


Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp 

265 


270 


Aia Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr 


280 


Leu Pro Ser Pro 
285 


*“ Aju Ays Pro Glu Ays Ays v.l Al. Glu Al. Glu Ays Ays v.l Glu 


300 


Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu 


310 


315 


Glu Asp Arg Arg Asn 
320 


Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu He Ala Glu Ser 
325 330 335 

Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala 

345 350 

Ays Glu Pro Arg As„ Glu Glu Ays v.l Ays Glu Al. Ays Al. Glu v.l 

360 365 

Glu ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys He Lys Thr Asp 

373 380 

£9 Ays Ays Al. Glu Glu Glu Al. Ays Arg Ays Al. Al. Glu Glu Asp 

395 400 

lys V.1 Ays Glu Ays Pro Al. Glu Glu Pro Glu Pro Al. Pro Al. Pro 

410 415 

Ays Al. Glu Ays Pro Al. Pro Al. Pro Ays Pro Glu Asu Pro Al. Glu 


Gin Pro Lys Ala Glu Lys Pro Ala 
435 


440 


425 430 

Asp Gin Gin Ala Glu Glu 
445 


<210> 10 
<211> 414 
<212> PRT 

<213> Artificial Sequence 


< 220 > 
<223 > 


St™ Pti ° n: * mino acid sec ? uen ce derived from 
Streptococcus pneumoniae 


a cDNA from the genome of 

Thr Glu Asn Glu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala 
5 10 15 

Lys Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu Tyr He 

25 30 

Glu Ays Met Aeu Arg Glu 11. Glu Asu Asp Arg Arg Ays His Thr Glu 

11 
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35 

Asn Val Ala Leu 
50 

Arg Glu Leu Asn 
65 

Glu lie Lys Ala 


Thr Leu Lys Pro 
100 

Glu Ala Lys Lys 
115 

Tyr Pro Thr Asn 
130 

Asp Val Lys Val 
145 

Lys Glu Ser Arg 

Glu Ser Lys Lys 
180 

Arg Lys Lys Ala 
195 

Lys Glu Ala Asn 
210 

Ala Lys Arg Gly 
225 

Asn Asp Ala Lys 


Ser Ser Ser Leu 
260 

Val Glu Glu Ala 
275 

Arg Asn Tyr Pro 
290 

Glu Ser Asp Val 
305 

Glu Ala Lys Glu 


Lys Val Glu Ser 
340 

Thr Asp Arg Asp 
355 


40 

Asn lie Lys Leu 
55 

yal Leu Glu Glu 
70 

Lys Leu Asp Ala 
85 

Gly Glu Lys Val 


Lys Ala Glu Asp 
120 

Thr Tyr Lys Thr 
135 

Lys Glu Ala Glu 
150 

Asn Glu Gly Thr 
165 

Ala Glu Ala Thr 


Glu Glu Glu Ala 
200 

Val Ala Thr Ser 
215 

Val Pro Gly Glu 
230 

Ser Ser Asp Ser 
245 

Lys Ser Gly Lys 


Glu Lys Lys Ala 
280 

Thr Asn Thr Tyr 
295 

Lys Val Lys Glu 
310 

Pro Arg Asp Glu 
325 

Lys Lys Ala Glu 

Asp Ala Glu Glu 
360 


Ser Ala lie Lys 
60 

Lys Ser Lys Asp 
75 

Ala Phe Glu Lys 
90 

Ala Glu Ala Lys 
105 

Gin Lys Glu Glu 


Leu Glu Leu Glu 
140 

Leu Glu Leu Val 
155 

lie Lys Gin Ala 
170 

Arg Leu Glu Asn 
185 

Lys Arg Lys Ala 


Asp Gin Gly Lys 
220 

Leu Ala Thr Pro 
235 

Ser Val Gly Glu 
250 

Lys Val Ala Glu 
265 

Lys Asp Gin Lys 


Lys Thr Leu Asp 
300 

Ala Glu Leu Glu 
315 

Glu Lys lie Lys 
330 

Ala Thr Arg Leu 
345 

Glu Ala Lys Arg 


45 

Thr Lys Tyr Leu 


Glu Leu Pro Ser 
80 

Phe Lys Lys Asp 
95 

Lys Lys Val Glu 
110 

Asp Arg Arg Asn 
125 

lie Ala Glu Phe 


Lys Glu Glu Ala 
160 

Lys Glu Lys Val 
175 

lie Lys Thr Asp 
190 

Asp Ala Lys Leu 
205 

Pro Lys Gly Arg 


Asp Lys Lys Glu 
240 

Glu Thr Leu Pro 
255 

Ala Glu Lys Lys 
270 

Glu Glu Asp Arg 
285 

Leu Glu He Ala 


Leu Val Lys Glu 
320 

Gin Ala Lys Ala 
335 

Glu Asn lie Lys 
350 

Lys Ala Ala Glu 
365 
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Glu 

Asp 

370 

Lys 

Val 

Lys 

Glu 

Lys 

375 

Pro 

Ala 

Glu 

Gin 

Pro 

380 

Gin 

Pro 

Ala 

Pro 

Ala 

385 

Thr 

Gin 

Pro 

Glu 

Lys 

390 

Pro 

Ala 

Pro 

Lys 

Pro 
3 95 

Glu 

Lys 

Pro 

Ala 

Glu 

400 

Gin 

Pro 

Lys 

Ala 

Glu 

405 

Lys 

Thr 

Asp 

Asp 

Gin 

410 

Gin 

Ala 

Glu 

Glu 




< 210 > 11 
<211> 425 
<212> PRT 

<213 > Artificial Sequence 
< 220 > 

Streptococcus^pneumoniae 36 '^ 1 ^ 06 deriv * d from a cDNA f rom the genome of 
<400> 11 

Thr Olu w Mu V,1 Thu Thr Gl» Vel *1. Thr Ser Ser Asn Arg Al, 


10 


15 


Asn Glu Ser Gin Ala Gly His Arg Lys Ala Ala 


20 


25 


Glu Gin Plie Asp Glu 


30 


Tyr lie Lye Thu Met He Glu Leu Asp Arg Arg Lye Hie Thr Glu Aeu 

40 45 

Pfae Ala Leu Asn lie Lys Leu Ser Arg lie Lys Thr Glu Tyr Leu Arg 


60 


Lys Leu Asn Val Leu Glu Glu Lys Ser Lys Ala Glu 


70 


75 


Leu Pro Ser Glu 
80 

Thr Ly. Lye Glu lie Asp Al. Ale The Glu Glu Phe Lye Lys Asp Thr 
85 90 95 

Arg Thr Lys Lys Thr Val Ala Glu Ala Glu Lys Lys Val Glu Glu 

105 110 

Ala Lys Lys Lys Ala Lys Ala Gin Lys Glu Glu Asp His Arg Asn Tyr 

120 125 

Pro Thr Aeu Thr Tyr Lye Thr Leu Glu Leu Glu He Ale Glu Her Asp 

135 140 

Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys 

150 15 5 160 

Glu Ser Arg Asp Asp Glu Lys He Lys Gin Ala Glu Ala Lys Val Glu 
165 170 175 

Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg 

185 190 

Glu Lys Ala Glu Glu Glu Ala Lys Arg Arg Ala Glu Ala Lys Leu Lys 

200 205 

G!u Ala Val Glu Lys Asn Val Ala Thr Ser Glu Gin Asp Lys Pro Lys 


220 
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Gly Arg Arg Lys Arg Gly Val Pro Gly Glu Gin Ala Thr Pro Asp Lys 
225 230 23S 

Lys Glu Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Ala 

2 ‘-it? r\ 2 D D 


245 


250 


Leu Pro 


Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu 


270 


260 265 

Lvs Lys Val Ala Glu Ala Glu Lys Lys Ala Lys Ala Gin Lys Glu Glu 
1 J 285 


275 


280 


Asp Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu 
290 295 300 

He Ala Glu Ser Asp Val Lys Val Lys Glu Ser Glu Leu Glu Leu Val 
305 310 315 

Glu Glu Ala Lys Glu Ser Arg Asn Glu Glu Lys Val Asn Gin Ala 


Lys 


335 


325 330 

Lys Ala Lys Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Lys 

1 ^ ac 350 


340 3 45 

lie Lys Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala 


355 


360 


365 


Ala Glu Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro 

375 380 


370 


Ala Pro Ala Pro Gin Pro Glu Lys Pro Thr Glu Glu Pro Glu Asn Pro 


385 


390 


395 


400 


Ala Pro Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin Pro Lys Ala Glu 
405 410 415 

Lys Thr Asp Asp Gin Gin Ala Glu Glu 
420 425 


< 210 > 12 
<211> 426 
<212> PRT 

<213 > Artificial Sequence 

<223> Description: Amino acid sequence derived from a cDNA from the genome of 
Streptococcus pneumoniae 

Thr° Glu . 2 Lys Glu Val Thr Thr Gin Val Ala Thr Ser Ser Asn Lys Ala 
1 5 . 10 15 

Asn Lys Ser Gin Thr Glu His Met Lys Ala Ala Lys Gin Val Asp Glu 
20 25 30 

Tyr lie Lys Lys Lys He Gin Leu Asp Arg Arg Lys His Thr Gin Asn 

35 40 45 

Val Gly Leu Leu Thr Lys Leu Gly Val lie Lys Thr Glu Tyr Leu His 

J c. n 
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Gly Leu Ser Val Ser Lys Lys Lys Ser Glu Ala Glu Leu Pro Ser Glu 


70 


75 


80 


lie Lys Ala Lys Leu Asp Ala Ala Phe 


Glu Gin Phe Lys Lys Asp Thr 
85 90 95 


Leu Pro Thr Glu Pro Gly Lys Lys Val Ala 


100 


Glu Ala Glu Lys Lys Val 
105 110 


Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Lys Asp Leu Arg 

11b -i -» 3 


120 


125 


Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Asp He Ala Glu 


135 


140 


Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu 


150 


155 


180 


Ala Lys Glu Ser Arg Asp Glu Lys Lys lie Asn Gin Ala Lys Ala Lys 
165 170 175 

Val Glu Asn Lys Lys Ala Glu Ala Thr Arg Leu Lys Asn lie Lys Thr 
180 185 190 

Asp Arg Glu Lys Ala Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Leu 
195 200 205 

Gin Glu Ala Asn Val Ala Thr Ser Glu Gin Asp Lys Ser Lys Arg Arg 

215 220 

Ala Lys Arg Glu Val Leu Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu 

230 235 240 

Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Thr 
245 250 255 

Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys 


265 


270 


Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu 


280 


285 


Asp Arg 


Arg Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie 


295 


Ala 


300 


Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu 

310 215 32o 

Glu Ala Lys Glu Ser Arg Asn Glu- Glu Lys lie Lys Gin Val Lys Ala 

325 330 335 

Lys Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn He 


340 


345 


350 


Lys 


Thr Asp Arg Lys Lys Ala Glu Glu Glu Glu Ala Lys Arg Arg Ala Ala 


360 


365 


Giu Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala 


375 


380 


Pro Ala Pro Gin Pro Glu Lys Pro Thr 


385 


390 


Glu Glu Pro Glu Asn Pro Ala 


395 


400 
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Pro Ala Pro Ala Pro Lys Pro Glu Asn Pro Ala Glu Lys Pro Lys Ala 
405 410 415 


Glu Lys Pro Ala Asp Gin Gin Ala Glu Glu 
420 425 


<210> 13 
<211> 425 
<212> PRT 

<213 > Artificial Sequence 


< 220 > 

<223> 


Description: Amino acid sequence derived from a cDNA from the genome of 
Streptococcus pneumoniae 


<400=> 13 

Thr Glu Lys Glu Val Thr Thr Gin 
1 5 

Asn Lys Ser Gin Thr Glu His Met 
20 

Tyr lie Lys Lys Lys Leu Gin Leu 
35 40 

Val Gly Leu Leu Thr Lys Leu Gly 
50 55 

Gly Leu Ser Val Ser Lys Lys Lys 
65 70 

lie Lys Ala Lys Leu Asp Ala Ala 
85 

Leu Pro Thr Glu Pro Gly Lys Lys 
100 

Glu Glu Ala Lys Lys Lys Ala Glu 
115 120 

Asn Tyr Pro Thr Asn Thr Tyr Lys 
130 125 

Ser Asp Val Glu Val Lys Lys Ala 
145 150 

Ala Lys Glu Ser Arg Asp Glu Lys 
165 

Val Glu Asn Lys Lys Ala Glu Ala 
180 

Asp Arg Glu Lys Ala Glu Glu Ala 
195 200 

Gin Glu Ala Asn Val Ala Thr Ser 
210 215 

Ala Lys Arg Glu Val Phe Gly Glu 
225 230 


Val Ala Thr Ser Ser Asn Lys Ala 
10 15 

Lys Ala Ala Lys Gin Val Asp Glu 
25 30 

Asp Arg Arg Lys His Thr Gin Asn 
45 

Val He Lys Thr Glu Tyr Leu His 
60 

Ser Glu Ala Glu Leu Pro Ser Glu 
75 80 

Phe Glu Gin Phe Lys Lys Asp Thr 
90 95 

Val Ala Glu Ala Glu Lys Lys Val 
105 110 

Asp Gin Lys Glu Lys Asp Leu Arg 
125 

Thr Leu Glu Leu Asp lie Ala Glu 
140 

Glu Leu Glu Leu Val Lys Glu Glu 
155 160 

Lys lie Asn Gin Ala Lys Ala Lys 
170 I 75 

Thr Arg Leu Lys Asn lie Lys Thr 
185 190 

Lys Arg Arg Ala Asp Ala Lys Leu 
205 

Glu Gin Asp Lys Ser Lys Arg Arg 
220 

Leu Ala Thr Pro Asp Lys Lys Glu 
235 240 
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Asn 

Asp 

Ala 

Lys 

Ser 

245 

Ser 

Asp 

Ser 

Ser 

Val 

250 

Gly 

Glu 

Glu 

Thr 

Leu 

255 

Thr 

Ser 

Pro 

Ser 

Leu 

260 

Lys 

Pro 

Glu 

Lys 

Lys 

265 

Val 

Ala 

Glu 

Ala 

Glu 

270 

Lys 

Lys 

Val 

Glu 

Glu 

275 

Ala 

Lys 

Lys 

Lys 

Ala 

280 

Glu 

Asp 

Gin 

Lys 

Glu 

285 

Glu 

Asp 

Arg 

Arg Asn 
290 

Tyr 

Pro 

Thr 

Asn 

Thr 

295 

Tyr 

Lys 

Thr 

Leu 

Glu 

300 

Leu 

Glu 

lie 

Ala 

Glu 

305 

Ser 

Asp 

Val 

Glu 

Val 

310 

Lys 

Lys 

Ala 

Glu 

Leu 

315 

Glu 

Leu 

Val 

Lys 

Glu 

320 

Glu 

Ala 

Lys 

Glu 

Ser 

325 

Arg Asn 

Glu 

Glu 

Lys 

330 

lie 

Lys 

Gin 

Val 

Lys 

335 

Ala 


LyS Val Glu ^ A la Glu Ala Thr Arg Leu Glu Asn lie Lys 

340 345 350 

Thr Asp Arg Lys Lys Ala Glu Glu Glu Glu Ala Lys Arg Arg Ala Ala 
355 360 36S 

G!u Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala 


375 


380 


Pro Ala Pro Gin Pro Glu Lys Pro Thr Glu Glu Pro Glu Asn Pro Ala 
385 390 395 4£J0 

Pro Ala Pro Ala Pro Lys Pro Glu 


Asn Pro Ala Glu Lys Pro Lys Ala 
405 410 415 


Glu Lys Pro Ala Asp Gin Gin Ala Glu 


420 


425 


<210> 14 
<211> 424 
<212> PRT 

< ^13> Artif icial Seguence 
< 220 > 

<223 > Description: Amino acid sequence derived from 
S t r ept ococcus pneumoni ae 


a cDNA from the genome of 


<400> 14 

Thr Glu Lys Glu Val Thr Thr Gin Val Ala Thr Ser Ser Asn Arg Ala 


10 


Asn Lys Ser Gin Thr Glu His Met Lys Ala Ala Lys Gin Val 


20 


Tyr He Lys Lys Lys Leu Gin Leu 
35 


25 


15 


Asp Glu 


30 


Asp Arg Arg Lys His Thr Gin Asn 
40 45 


Val Gly Leu Leu Thr Lys Leu Gly Val He Lys Thr Glu Tyr Leu His 

55 60 


Gly Leu Ser Val Ser Lys Lys Lys Ser Glu Ala Glu Leu Pro Ser Glu 

03 


70 


75 


80 


17 
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He Lys Ala Lys Leu Asp Ala Ala Phe Glu Gin Phe Lys Lys Asp Thr 


85 


90 


Leu Pro 


Thr Glu Pro Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val 


110 


100 105 
Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Lys Asp Leu Arg 


115 120 125 

Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Asp lie Ala Glu 
130 135 140 

Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu 


145 


150 


155 


Ala Lys Glu Ser Arg Asp Glu Lys Lys lie Asn Gin Ala Lys Ala Lys 

170 I 75 


165 


Val Glu Asn Lys Lys Ala Glu Ala Thr Arg Leu Lys Asn lie Lys Thr 
180 185 190 

Asp Arg Glu Lys Ala Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Leu 

!95 200 205 

Gin Glu Ala Asn Val Ala Thr Ser Glu Gin Asp Lys Ser Lys Arg Arg 


220 


210 215 

Ala Lys Arg Glu Val Leu Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu 


225 


230 


235 


Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Thr 
245 250 255 

Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala Glu Lys Lys 
260 265 270 

Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg 
275 280 285 

Arq Asn Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala 
290 285 300 

Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu Val Lys Glu 

305 310 315 320 

Glu Ala Lys Glu Ser Arg Asn Glu Glu Lys lie Lys Gin Val Lys Ala 


325 


330 


Lys Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn He Lys 
340 345 350 

Thr Asp Arg Lys Lys Ala Glu Glu Glu Glu Ala Lys Arg Arg Ala Ala 
355 360 365 

Glu Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala 
370 375 380 

Ala Pro Gin Pro Glu Lys Pro Thr Glu Glu Pro Glu Asn Pro Ala 


Pro 


395 


400 


385 390 

Pro Ala Pro Ala Pro Lys Pro Glu Asn Pro Ala Glu Lys Pro Lys Ala 


18 
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405 

Glu Lys Pro Ala Asp Gin Gin Ala 
420 


< 210 > 15 
< 211 > 419 

<212> PRT 

<213> Artificial Sequence 
< 220 > 

<223> t ro» , CDMA from che 

<400> 15 

Thr Glu Asn -Glu Arg Thr Thr Gin Val Pro Thr Ser Ser Asn Arg Gl y 

Lys Pro Glu Arg Arg Lys Ala Ala Glu Gin Phe Asp Glu Tyr lie Asn 

25 30 


PCT/US99/07680 


genome of 


Lys Met He Gin Leu Asp Lys Arg Lys 


35 


40 


His Thr Gin Asn Leu Ala Phe 


45 


Asn He Gin Leu Ser 


50 


Arg lie Lys Thr Glu Tyr Leu Asn Gly Leu Lys 


60 


Glu Lys Ser Glu Ala Glu Leu Pro Ser Lys 


70 


Ala Ala Phe Lys Gin Phe Lys Lys Asp Thr 


85 


90 


lie Lys Ala Glu Leu Asp 
75 80 

Leu Pro Thr Glu Pro Glu 
95 


LyS ^ ~ £ ““ “* «» - Ly. *1 Glu Glu Ala Glu Ly . Dys 

105 110 

V11 “* S "* *" ^ ^ «» Glu Glu Asp HI. 


125 


Arg Asn Tyr Pro Thr He Thr Tyr Lys Thr 


135 


Leu Asp Leu Glu lie Ala 


140 


145 ““ AaP Vil Si *** G1 “ “* G1 “ L~ Glu Leu V.1 Ly. Ly. 

1SS 160 
Glu Ala Asp Glu Her Arp As„ Glu Gly Thr lie Aen Glu Ala Lys Ala 

170 175 

Ly. V .1 G1U Her Glu Ly. Ala Glu Ala Thr Arg Leu Ly. Ly. He Ly. 

185 190 

Thr Asp Arg Glu Ly. Ala Glu Glu Glu Glu Ala Ly. Arg Arg Ala Asp 

^ U0 205 

Aia Lys Glu Gin Asp Glu Ser Lys Arg Arg Lys Ser Arg Gly Lys Arg 

220 

Gly Ala Leu Gly Glu Glu Ala Thr Pro Asp Ly. Ly. Glu A,u Asp Ala 

235 240 

Ly. Her ser A.p Ser Ser Val Gly Glu Glu Thr Leu Pro Ser Pro Ser 

19 
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Lys Pro Gly lys lys M Ala Glu Ala Gl« lys lys val Glu Glu 

260 - 265 

Ala ASP lays Lys Ala Lys Ala Gin Lys Glu Glu Asp Arg Arg Asn Tyr 
275 280 

Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp 
290 295 300 

Val Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys 
305 310 

Glu Ser Arg Asn Glu Glu Lys He Lys Gin Ala Lys Ala Lys Val Glu 


Ser Lys Lys 


Ala Glu Ala Thr Arg Leu Glu Lys lie Lys Thr Asp Arg 


Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys 
355 36° 365 

Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Pro Gin 
370 375 380 

Pro Glu Lys Pro Ala Glu Glu Pro Glu Asn Pro Val Pro Ala Pro Lys 
385 39° 395 

Pro Glu Asn Pro Ala Glu Gin Pro Lys Ala Glu Lys Pro Ala Asp Gin 
405 410 

Gin Ala Glu 


<210> 16 
<211> 414 
<212> PRT 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: Amino acid sequence 
Streptococcus pneumoniae 


derived from a cDNA from the genome of 


^ < Gli 6 Asn Glu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala 
I s 10 

Lys Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu Tyr lie 
20 25 

Glu Lys Met Leu Arg Glu He Gin Leu Asp Arg Arg Lys His Thr Gin 
35 40 45 

Asn Val Ala Leu Asn He Lys Leu Ser Ala He Lys Thr Lys Tyr Leu 
50 55 6 ° 

Arg Glu Leu Asn Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser 
65 70 

Glu He Lys Ala Lys Leu Asp Ala Ala Phe Glu Lys Glu Lys Lys Asp 
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85 30 95 

Thr Usu Lys Pro Gly G l„ Lys Val au Ala Lys ^ W1 slu 


Glu Ala Lys Lys Lys Ala Glu 


110 


115 


Asp Gin Lys Glu Glu Asp Arg Arg 


Asn 


125 


Phe 


Pro Thr Asn Tin* Tvr Lvc nr t 

13 0 17 jXr L Glu . Leu Glu He Ala Glu 

J:> 140 

tS V41 Lys Val Lya S ala G1 “ Leu Q1 “ - ~ «. Glu Al, 

155 - _ _ 

160 

ly * ° 1U S “ 419 JS Gly Tte “* G1 ” «* «« Glu Lys Val 

170 175 

Olu Sea Lys lye Al. Glu * 1 , Thr at „ Ao „ ^ Lys ^ 

b 190 

*** Ll " SS "* G1 “ % «*■ *>» **- «. *>P Ala Lys Leu 

^ U0 205 

L y s Glu Ala Asn Val Ala Thr Ser Asp Gin Gly Lys Pro Lys Gly Arg 

“ 220 

2 « LyS Ar9 Gly Val 230 017 G1U LeU Ala I?* Pro *SP Lys Glu 

235 240 

Asa Asp Ala Lys Ser Ser Asp Ser Sar Val Gly Glu Glu Thr Leu Pro 

250 255 

ser ser Ser Leu Lys Ser Gly Lys Lys Val Ala Glu Ala Glu Lys Lys 

270 

Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp Arg 

280 285 

ATS As„ lyr Pro Thr As „ ^ Lys ^ ^ A=p w 

300 4 

Clu ser ASP val Ly. Val Ly. Glu Ala Glu Leu Glu Leu Val Lys Glu 
Glu Ala Lys Glu Pro Arg Asp Glu Glu 


320 


325 


Lys lie Lys Gin Ala Lys Ala 


330 


335 


ly. Val Glu ser Lys Lys Ala Glu Ala Thr Arp Leu Glu Asn lie Lys 

5 350 

Thr Asp Arp Lys Ly. Al. Glu Glu Glu Al. Lys Arp Lys Ala Al. Glu 


Glu Asp Lys Val Lys Glu Lys Pro 


365 


370 


375 


Ala Glu Gin Pro Gin Pro Ala Pro 


380 


Ala Thr Gin Pro Glu Lys Pro Ala Pro Lys Pro Glu 


390 


395 


Lys Pro Ala Glu 


Gin Pro Lys Al. Glu Lys Thr Asp Asp Gin Gin Ala Glu Glu 


400 


410 


21 
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<210> 17 
<211> 412 
<212> PRT 

< 2i3> Artificial Sequence 


< 220 > 

<223> 


Description: Amino acid sequence derived from a 
Streptococcus pneumoniae 


cDNA from the genome of 


<400> 17 
Glu Gly val Arg 
l 

Asp He Ser Lys 
20 

He Leu Ser Glu 
35 

Thr Val Asn Leu 
50 

Glu Leu Asn 
65 

Lys He Lys Ala 


Thr Leu Pro Thr 
100 

Val Glu Glu Ala 
115 

Arg Asn Tyr Pro 
130 

Glu Ser Asp Val 
145 

Glu Ala Asp Glu 


Lys Val Glu Ser 
180 

Thr Asp Arg Glu 
195 

Lys Glu Gin Asp 
210 

Asp Phe Gly Glu 
225 

Ser Ser Asp Ser 


Lys Pro Gly Lys 
260 


Ser Glu Asn Asn 
5 

Lys Tyr Ala Asp 


He Gin Thr Asn 
40 

lie Asn Lys Leu 
55 

Val Leu Glu Asp 
70 

Glu Leu Asp Ala 
85 

Glu Pro Gly Lys 


Glu Lys Lys Ala 
120 

Thr lie Thr Tyr 
135 

Lys Val Lys Glu 
150 

Ser Arg Asn Glu 
165 

Glu Gin Ala Glu 


Lys Ala Glu Glu 
200 

Glu Ser Lys Arg 
215 

Pro Ala Thr Pro 
230 

Ser Val Gly Glu 
245 

Lys Val Ala Glu 


Pro Thr Val Thr 
10 

Glu Val Lys Ser 
25 

Leu Asp Arg Ser 


Gin Asp lie Lys 
60 

Lys Ser Lys Ala 
75 

Ala Phe Glu Gin 
90 

Lys Val Ala Glu 
105 

Lys Ala Gin Lys 


Lys Thr Leu Glu 
140 

Ala Glu Leu Glu 
155 

Gly Thr lie Asn 
170 

Ala Thr Arg Leu 
185 

Glu Ala Lys Arg 


Arg Lys Ser Arg 
220 

Asp Lys Lys Glu 
235 

Glu Thr Leu Pro 
250 

Ala Glu Lys Lys 
265 


Ser Ser Gly Gin 
15 

His Leu Glu Lys 
30 

Lys His '’lie Lys 
45 

Arg Thr Tyr Leu 


Glu Leu Pro Ser 
80 

Phe Lys Lys Asp 
95 

Ala Lys Lys Lys 
110 

Glu Glu Asp Tyr 
125 

Leu Glu He Ala 


Leu Val Lys Lys 
160 

Gin Ala Lys Ala 
175 

Lys Lys Xle Lys 
190 

Arg Ala Asp Ala 
205 

Val Lys Arg Gly 


Asn Asp Ala Lys 
240 

Ser Pro Ser Leu 
255 

Val Glu Glu Ala 
270 
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SS ^ G1 “ «• ASP «- £, Asu Tyr Pro 

Thr 290 Thr *** LyS Thr 295 Glu Leu Glu Ile Ala Glu Ser Asp Val 

" 300 

ss val Lys Lya ua ■«- «“ -<• ~ a Glu Lys Gly 

315 320 

S " ^ *“ ° 1U S "* ™ ^ “*> £ *« «• «u « »„ ser 

LyS LyS A1 * “S Thr ^ ^ ““ Ays lie Lys Tht ^ ^ 

*** 350 

^ Ala 355 G1U G1U ^ ^ ** Glu Glu Asp Lys Val 

365 

Lys «. Lys Pro Ale »„ Gln pro ^ ^ ^ ^ ^ ^ 


Glu Lys Pro Ala Pro Ala Pro Lys Pro 


380 


390 


Glu Asn Pro Ala Glu Gin Pro 


395 


Lys Ala Glu Lys Pro Ala Asp Gin Gin Ala Glu Glu 


400 


410 


405 

<210> 18 
<211> 406 
<212> PRT 

<213 > Artificial Sequence 
<220> 

<223> Description: Amino acid sequence derive r 

Streptococcus pneumoniae from a cDNA from the 


genome of 


<400> 18 

Thr Glu Asn Glu Gly Thr Thr Gin 
1 5 


Ala Pro Thr Ser Ser Asn Arg 


10 


Gly 


15 


Asn OX. S« Olu -as Glu His Her Lys Ale A1 . Lys Gln Asp oiu 
Tyr II. Glu Lys ». t Leu Glo HSU Asp Arg Arg „ ys 2 Gin ^ 

V.1 n, HSU Leu Thr Lys Leu Sly Ale 11. Lys Thr Glu Tyr Leu Ar, 

60 


Gly Leu 
65 

Ser 

Val 

Ser 

Lys 

70 

Glu 

Lys 

Ser 

Thr 

Ile 

Lys 

Glu 

Lys 

Leu 

85 

Thr 

Ala 

Ala 

Phe 

Lys 

90 

Leu 

Lys 

Pro 

Glu 

100 

Lys 

Lys 

Val 

Ala 

Glu 

105 

Ala 

Ala 

Lys 

Lys 

115 

Lys 

Ala 

Glu 

Asp 

Gln 

120 

Lys 

Glu 


75 


80 


95 


110 


125 


23 
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Pro Thr II. Thr Tyr Lys Thr Lau =1. Lau Glu II. Al, <*» 

130 135 

val Olu V.1 Lys Lys *1. Olu Lau =1“ ““ ™ ^ V * 1 Lys “ T 


Glu Pro Arg 


Asp Glu Glu Lys He Lys 


lie Lys Gin Ala Glu Ala Glu Val Glu 
inn IV 5 


Sar I*. Lys Ala Olu Al. Thu Arg Lau Ly. Ly, II. Lys Thr Asp Arg 


Glu Lys 


Ala Glu Glu Glu Ala Lys Arg Arg val Asp Ala Lys Glu Glu 


ASP Glu Sar Gar Ly. Arg Arg Lys Gar Arg v.l Lys Arg Gly Asp Lau 
210 215 

gly Glu Ota Ala Thr Pro Asp Lys Lys Glu Ash Asp Al. Lys Gar Gar 

225 230 

Asp Ser ser Val Gly Glu Glu Thr Leu Pro Ser Pro Ser Leu Lys Pro 
245 2bU 

Gly Lys Ly. Val Al. Glu Ala Glu Lys Lys val Glu Glu Ala Asp Lys 

260 2feb 

Lys Ala Ly. Al. Ota Lys Glu Glu Asp Arg Arg Ash Tyr Pro Thr Ash 
275 280 

Thr Tyr Lys Thr Lau Glu Lau Glu II. Ala Glu Gar Asp V.l Glu V.l 

290 295 

Lys Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Pro Arg 
305 310 31 

Asn Glu Glu Lys Val Lys Gin Ala Lys Ala Glu Val Glu Ser Lys Lys 

325 33U 

Ala Glu Ala Thr Arg Leu Glu Lys He Lys Thr Asp Arg Lys Lys Ala 
340 345 

Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys Glu 
355 360 

Lys Pro Al. Glu Glh Pro Lys Pro Al, Pro Al. Pro Gl» Pro Glu Lys 
370 375 

Pro Al, Pro Lys Pro Glu Ash Pro Al. Glu Glh Pro Ly, Ala Glu Lys 


Pro Ala Asp Gin Gin Ala 
405 


<210> 19 
<211> H4 
<212> PRT 

<213> Artificial Sequence 
< 220 > 


<223 > Description: Amino acid sequence 


derived from a cDNA from the genome 
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Streptococcus pneumoniae 
<400> 19 

Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Lys Lys Lys 
15 10 15 

Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr 
20 25 30 

Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys 
35 40 45 

Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Ser Arg Asn 
50 55 60 

Glu Glu Lys lie Lys Gin Ala Lys Ala Lys Val Glu Ser Lys Lys Ala 

65 70 75 80 

Glu Ala Thr Arg Leu Glu Lys lie Lys Thr Asp Arg Lys Lys Ala Glu 

85 90 95 

Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys Glu Lys 
100 105 110 

Pro Ala 

<210> 20 
<211> 1295 
<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: cDNA derived from the genome S. pneumoniae 
<400> 20 

acagagaagg aggtaactac cccagtagcc acttcttcta ataaggcaaa taaaagtcag 60 

acagaacata tgaaagctgc tgaacaagtc gatgaatata taaacaaaat gatccaatta 120 

gataaaagaa aacataccca aaatctcgcc ttaaacataa agttgagcgc aattaaaacg 180 

aagtatttgc gtgaattaaa tgttttagaa gagaagtcga aaaaagaaga gttgacgtca 240 

aaaacaaaaa aagagataga cgcagctttt gagcagttta acaaagatac attgaaacca 300 

ggagaaaagg ttgaagaagc tgagaagaag gttgaagaag ctgagaaaaa agccaaggat 360 

caaaaagaag aagatcaccg taactaccca accattactt acaaaacgct tgaacttgaa 420 

attgctgagt ccgatgtgga agttaaaaaa gcggagcttg aactagtaaa agaggaagct 480 

^Gggaiictc gaaacgagga aaaaattaag aaagcaaaag cggaagttga gagtaaaaaa 540 

gctgaggcta caaagttaga agaaatcaag acagaacgta aaaaagcaga agaagaagct 600 

aaacgaaaag cagaagcaga agaagaagtt aaaaataaac taaagaagcg gacaaaacga 660 

ggagcttttg gagagccagc aacacctgat aaaaaagaaa atgatgcgaa gtcttcagat 720 

tctagcgtgg tgaagaaatc ttccaagccc atcctgaaat cagaaaaaaa agtagcagaa 780 

gctgagaaga aggttgcaga agctgagaag aaggttgcag aagctgaga^ aaaagccaag 840 

gatcaaaaag aagaagatcg ccgtaactac ccaaccaata cttacaaaac gcttgaactt 900 

gaaattgctg agtccgatgt gaaagttaaa gaagcggagc ttgaactagt aaaagaggaa 960 

gctaaggaac ctcaaaacga ggaaaaaatt aagcaagcaa aagcgaaagt tgagagtaaa 1020 

aaagctgagg ctacaaggtt agaaaaaatc aagacagatc gtaaaaaagc agaagaagct 1080 

aaacgaaaag tagcagaaga agataaagtt aaagaaaaac cagctgaaca accacaacca 1140 

gctcctgcac caaaaccagc gccggctcct caaccagaaa aaccagctga acaaccaaaa 1200 

gcagaaaaac cagctgatca acaagctgaa gaagactatg ctcgtagatc agaagaagaa 1260 

tataacccgc ttgacttaac agcaccggca aaagc 1295 

< 210 > 21 
<211> 755 

25 
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<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: cDNA from Streptococcus pneumoniae 
<400> 21 


acagagaacg agggaagtac ccaagcagcc acttcttcta atatggcaaa gacagaacat 60 

aggaaagctg ctaaacaagt cgtcgatgaa tatatagaaa aaatgttgag ggagattcaa 120 

ctagatagaa gaaaacatac ccaaaatgtc gccttaaaca taaagttgag cgcaattaaa 180 

acgaagtatt tgcgtgaatt aaatgtttta gaagagaagt cgaaagatga gttgccgtca 240 

gaaataaaag caaagttaga cgcagctttt gagaagttta aaaaagatac attgaaacca 300 

ggagaaaagg tagcagaagc taagaagaag gttgaagaag ctaagaaaaa agccgaggat 360 

caaaaagaag aagatcgtcg taactaccca accaatactt acaaaacgct tgaacttgaa 420 

* attgctgagt tcgatgtgaa agttaaagaa gcggagcttg aactagtaaa agaggaagct 480 

aaagaatctc gaaacgaggg cacaattaag caagcaaaag agaaagttga gagtaaaaaa 540 

gctgaggcta caaggttaga aaacatcaag acagatcgta aaaaagcaga agaagaagct 600 

aaacgaaaag cagcagaaga agataaagtt aaagaaaaac cagctgaaca accacaacca 660 

gcgccggcta ctcaaccaga aaaaccagct ccaaaaccag agaagccagc tgaacaacca 720 

aaagcagaaa aaacagatga tcaacaagct gaaga 755 


< 210 > 22 
<211> 1239 
<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: cDNA from Streptococcus pneumoniae 


<400> 22 

acagagaagg aggtaactac ccaagtaccc acttattcta atatggcaaa gacagaacat 60 
aggaaagctg ctaaacaagt cgtcgatgaa tatatagaaa aaatgttgag ggagattcaa 120 
ttagatagaa gaaaacatac ccaaaatttc gccttcaaca tgaagttgag cgcaattaaa 180 
acggagtatt tgtatggatt aaaagagaag tcggaagctg agttgccgtc agaagtaaaa 240 
gcaaagttag acgcagcttt tgagcagttt aaaaaagata cattgaaact aggagaaaag 300 
gtagcagaag ctgagaagaa ggttgcagaa gctgagaaaa aagccaaggc tcaaaaagaa 360 
gaagatcgcc gtaactaccc aaccaatact tacaaaacgc ttgaacttga aattgctgag 420 
tccgatgtgg aagttaaaaa agcggagctt gaactattga aagaggaagc taaaactcga 480 
aacgaggaca caattaacca agcaaaagcg aaagttgaga gtaaaaaagc tgaggctaca 540 
aagttagaag aaatcaagac agatcgtaaa aaagcagaag aagaagctaa acgaaaagca 600 
gaagcagaag aagataaagt taaagataaa ctaaagaggc ggacaaaacg agcagttcct 660 
ggagagccag caacacctga taaaaaagaa aatgatgcga agtcttcaga ttctagcgta 720 
ggtgaagaaa ctcttccaag cccatccctg aaatcaggaa aaaaggtagc agaagctgag 780 
aagaaggttg cagaagctga gaaaaaagcc aaggatcaaa aagaagaaga tcgccgtaac 840 
tacccaacca atacttacaa aacgcttgac cttgaaattg ctgagtccga tgtgaaagtt' 900 
aaagaagcgg agcttgaact agtaaaagag gaagctaagg gatctcgaaa cgaggaaaaa 960 
attaaccaag caaaagcgga agttgagagt aaaaaagctg aggctacaag gctagaaaaa 1020 
atcaagacag atcgtaaaaa agcagaagaa gaagctaaac gaaaagcagc agaagaagat 1080 
aaagttaaag aaaaaccagc tgaacaacca caaccagcgc cggctcctca accagaaaaa 1140 
ccaactgaag agcctgagaa tccagctcca gctccaaaac cagagaagcc agctgaacaa 1200 
ccaaaagcag aaaaaacaga tgatcaacaa gctgaagaa 1239 


<21G> 23 
<211> 1338 
<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<213 > Description: cDNA from Streptococcus pneumoniae 
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<400> 23 

acagagaacg agggagctac 
gcagaacaag gagaacaacc 
gtcgaggaat atgtaaaaaa 
catacaatta ctgtagctct 
aaaatagttg aatcaacctc 
aaagtagatg aagctgtgtc 
tcttccacta aaccggaagc 
gaaaaggtag cagaagctaa 
aaagaagaag atcgtcgtaa 
gctgagtccg atgtggaagt 
gaacct cgag acaagcaaaa 
gaggctacaa ggttaaaaaa 
cgaagagcag atgctaaaga 
ggagagctag caacaccfcga 
ggtgaagaaa ctcttccaag 
aagaaggttg aagaagctaa 
tacccaacca atacttacaa 
aaaaaagcgg agcttgaact 
gttaagcaag caaaagcgga 
atcaagacag atcgtaaaaa 
aaagttaaag aaaaaccagc 
ccagctccag ctccaaaacc 
gatcaacaag ctgaagaa 


ccaagtaccc acttcttcta 
taaaaaactc gattcagaac 
aatagtgggt gagagctatg 
agttaacgag ttgaacaaca 
agaaagccaa ctacagatac 
taagtttgaa aaggactcat 
ttcagataca gcgaagccaa 
gaagaaggtt gaagaagttg 
ctacccaacc aattacttac 
taaaaaagcg gagcttgaac 
aattaagcaa gcagaagcgg 
aatcaagaca gatcgtgaag 
gcaaggtaaa ccaaaggggc 
taaaaaagaa aatgatgcga 
cccatccctg aaaccagaaa 
gaaaaaagcc gaggatcaaa 
aacgcttgaa cttgaaattg 
agtaaaagag gaagctaagg 
agttgagagt aaaaaagctg 
agcagaagaa gaagctaaac 
tgaacaacca caaccagcgc 
agagaatcca gctgaacaac 


atagggcaaa tgaaagtcag 
gagataaggc aaggaaagag 
caaaat caac t aaaaagcga 
ttaagaacga gtatttgaat 
tgatgatgga gagtcgatca 
cttcttcgtc aagttcagac 
acaagccgac agaaccagga 
agaaaaaagc caaggatcaa 
aaacgcttga acttgaaatt 
tagtaaaagt gaaagctaac 
aagttgagag taaacaagct 
aagcagaaga agaagctaaa 
ggccaaaacg aggagttcct 
agtcttcaga ttctagcgta 
aaaaggtagc agaagctgag 
aagaagaaga tcgccgtaac 
ctgagjccga tgtggaagtt 
aacctcgaaa cgaggaaaaa 
aggctacaag gttagaaaaa 
gaaaagcagc agaagaagat 
cggctccaaa aacagaaaaa 
caaaagcaga aaaaccagct 
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<210> 24 
<211> 1284 
<212> DMA 

<213 > Artificial Sequence 


< 220 > 

<223> Description: cDNA prepared from Streptococcus pneumoniae genome 


<400> 24 

gaaggggtta gaagtgggaa 
aagtatgctg atgaagtcga 
ttgaagaaag ttcaacatac 
acgaagtatt tgtatgaatt 
aaaacaaaag aaacaaaaga 
ttatcaacag aaccagaaaa 
aaaaaagccg aggatcaaaa 
acgcttgaac ttgaaattgc 
gtaaaagtga aagctaacga 
gttgagagta aacaagctga 
gctgaggcta caaggttaga 
aaagttaaag atgaaccaaa 
cctgataaaa aagaaaatga 
ccaagcccat ccctgaaacc 
gctaagaaaa aagccgagga 
tacaaaacgc ttgaacttga 
gaactagtaa aagaggaagc 
gcggaagttg agagtaaaca 
aaaaaagcag aagaagaagc 
ccagctgaac aaccacaacc 
gaaaaaccag ctccagctcc 
c cage t gate aacaagctga 


taactccacg gttacatcta 
gtcgcatcta caaagtatat 
ccaaaatgcc gacttcaaca 
aaatgtttta gaagagaagt 
agagttaacc gcagcttttg 
aaaggtagca gaagetaaga 
agaaaaagat cgccgtaact 
tgagteegat gtggaagtta 
acctcgagac gaggaaaaaa 
ggctacaagg ttaaaaaaaa 
aaacatcaag acagatcgtg 
gaagcggaca aaaegaggag 
tgegaagtet teagatteta 
agaaaaaaag gttgeagaag 
tcaaaaagaa gaagatcgtc 
aattgctgag tccgatgtgg 
taaggaacct egaaaegagg 
agetgagget acaaggttag 
taaacgaaaa gcagcagaag 
agcgccggct cctcaaccag 
aaaaccagag aatccagctg 
agaa 


gtgggcaaga tatategaag 
tgaaggatgt caataaaaat 
aaaagttgag caaaattaaa 
cggaagctga gttgaegtea 
agcagtttaa aaaagataca 
agaaggttga agaagetaag 
acccaaccat tacttacaaa 
aaaaagcgga gcttgaacta 
ttaagcaagc agaagegaaa 
tcaagacaga tcgtgaacaa 
aacaagcaga agaagaaget 
ttcttggaga gccagcaaca 
gcgtaggtga agaaactctt 
ctgagaagaa ggttgaagaa 
gtaactaccc aaccaatact 
aagttaaaaa ageggagett 
aaaaagttaa gcaagcaaaa 
aaaacatcaa gacagatcgt 
aag a t aaagt t aaagaaaaa 
aaaaaccagc tccaaaacca 
aacaac c aaa agcagaaaaa 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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900 

960 

1020 
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<210> 25 
<211> 658 


.9951 266A2_I_> 


27 



% 


WO 99/51266 


PCT/US99/07680 


<212> DNA 

<213> Artificial Sequence 


< 220 > 

<213 > Description: 


cDNA derived from genome of Streptococcus pneumoniae 


<400> 25 

gaaggggtta 

aagtatgctg 

ttgaaaaaag 

ccgaagtatt 

aaaccaaaga 

tcaacagaac 

aaagccgagg 

cttgaacttg 

aaagaggaag 

gagagtaaac 

gaagaagaag 


gaagtgggaa 

atgaagtcga 

ttcaacatac 

tgtatgaatt 

acaaaagaag 

cagaaaaaaa 

atcaaaaaga 

aaattgctga 

ctaaggaacc 

aagctgaggc 

ctaaacgaaa 


taactccacg 

gtcgcatcta 

ccaaaatgcc 

aaagtgttta 

agttaccgca 

ggtagcagaa 

aaaagatcgc 

gtccgatgtg 

tcgaaacgag 

tacaaggtta 

agcagcagaa 


gttacatcta gtgggcaaga tatatcgaag 60 
caaagtatat tgaaggatgt caataaaaat 120 
gacttcaaca aaaagttgag caaaattaaa 180 
gaagagaagt cggaagctga gttgacgtca 240 
gcttttgagc agtttaaaaa agatacatta 300 
gctaagaaga aggttgaaga agctaagaaa 360 
cgtaactacc caaccattac ttacaaaacg 420 
gaagttaaaa aagcggagct tgaactagta 480 
gaaaaagtta agcaagcaaa agcggaagtt 540 
gaaaaaatca agacagatcg taaaaaagca 600 
gaagataaag ttaaagaaaa accagctg 658 


<210> 26 
<211> 1338 
<212> DNA 

<213 > Artificial Sequence 


< 220 > 

<223 > Description: 


cDNA derived from genome of Streptococcus pneumoniae 


<400> 26 

acagagaacg agggagctac 
gcagaacaag gagaacaacc 
gtcgaggaat atgtaaaaaa 
catacaatta ctgtagctct 
aaaat ag 1 1 g aa t caac c t c 
aaagtagatg aagctgtgtc 
tcttccacta aaccggaagc 
gaaaaggtag cagaagctaa 
aaagaagaag atcgtcgtaa 
gctgagtccg atgtggaagt 
gaacctcgag acgagcaaaa 
gaggctacaa ggttaaaaaa 
cgaagagcag atgctaaaga 
ggagagctag caacacctga 
ggt gaagaaa c t c 1 1 c caag 
aagaaggttg aagaagctaa 
tacccaacca atacttacaa 
aaaaaagcgg agcttgaact 
gttaagcaag caaaagcgga 
atcaagacag atcgtaaaaa 
aaagttaaag aaaaaccagc 
ccagctccag ctccaaaacc 
gatcaacaag ctgaagaa 


ccaagtaccc acttcttcta 
taaaaaactc gattcagaac 
aatagtgggt gagagctatg 
agttaacgag ttgaacaaca 
agaaagccaa ctacagatac 
taagtttgaa aaggactcat 
ttcagataca gcgaagccaa 
gaagaagg 1 1 gaag aagc t g 
ctacccaacc attacttaca 
taaaaaagcg gagcttgaac 
aattaagcaa gcagaagcgg 
aatcaagaca gatcgtgaag 
gcaaggtaaa ccaaaggggc 
taaaaaagaa aatgatgcga 
cccatccctg aaaccagaaa 
gaaaaaagcc gaggatcaaa 
aacgcttgaa cttgaaattg 
agtaaaagag gaagctaagg 
agttgagagt aaaaaagctg 
agcagaagaa gaagctaaac 
tgaacaacca caaccagcgc 
agagaatcca gctgaacaac 


atagggcaaa tgaaagtcag 
gagataaggc aaggaaagag 
caaaatcaac taaaaagcga 
ttaagaacga gtatttgaat 
tgatgatgga gagtcgatca 
cttcttcgtc aagttcagac 
acaagccgac agaaccagga 
agaaaaaagc caaggatcaa 
aaacgcttga acttgaaatt 
tagtaaaagt gaaagctaac 
aagttgagag taaacaagct 
aagcagaaga agaagctaaa 
gggcaaaacg aggagttcct 
agtcttcaga ttctagcgta 
aaaaggtagc agaagctgag 
aagaagaaga tcgccgtaac 
ctgagtccga tgtggaagtt 
aacctcgaaa cgaggaaaaa 
aggctacaag gttagaaaaa 
gaaaagcagc agaagaagat 
cggctccaaa agcagaaaaa 
caaaagcaga aaaaccagct 
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180 
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300 

360 
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<210> 27 
<211> 1242 
<212> DNA 

<213 > Artificial Sequence 


< 220 > 

<223 > Description 


cDNA derived from genome of Streptococcus pneumoniae 


28 


BNSDOCID: <WO 9951266A2_t_> 



WO 99/51266 


PCT/US99/07680 


I 




<400> 27 

acagaaaacg aaggaagtac 
aggaaagctg ctaaacaagt 
ctagatagaa gaaaacatac 
acgaagtatt tgcgtgaatt 
gaaataaaag caaagttaga 
ggagaaaagg tagcagaagc 
caaaaagaag aagatcgtcg 
attgctgagt tcgatgtgaa 
aaagaatctc gaaacgaggg 
gctgaggcta caaggttaga 
aaacgaaaag cagatgctaa 
ccaaaggggc gggcaaaacg 
aatgatgcga agtcttcaga 
aaatcaggaa aaaaggtagc 
aaggatcaaa aagaagaaga 
cttgaaattg ctgagtccga 
gaagctaagg aacctcgaga 
aaaaaagctg aggctacaag 
gaagctaaac gaaaagcagc 
caaccagcgc cggctactca 
caaccaaaag cagaaaaaac 
<210> 28 
<211> 1275 
<212> DNA 


ccaagcagcc acttcttcta 
cgtcgatgaa tatatagaaa 
ccaaaatgtc gccttaaaca 
aaatgtttta gaagagaagt 
cgcagctttt gagaagttta 
taagaagaag gttgaagaag 
taactaccca accaatactt 
agt taaagaa gcggagcttg 
cacaattaag caagcaaaag 
aaacatcaag acagatcgta 
gttgaaggaa gctaatgtag 
aggagttcct ggagagctag 
ttctagcgta ggtgaagaaa 
agaagctgag aagaaggttg 
tcgccgtaac tacccaacca 
tgtgaaagtt aaagaagcgg 
cgaggaaaaa attaagcaag 
gttagaaaac atcaagacag 
agaagaagat aaagttaaag 
accagaaaaa ccagctccaa 
agatgatcaa caagctgaag 


atatggcaaa gacagaacat 
aaatgttgag ggagattcaa 
taaagttgag cgcaattaaa 
cgaaagatga gttgccgtca 
aaaaagatac attgaaacca 
ctaagaaaaa agccgaggat 
acaaaacgct tgaacttgaa 
aactagtaaa agaggaagct 
agaaagttga gagtaaaaaa 
aaaaagcaga agaagaagct 
cgacttcaga tcaaggtaaa 
caacacctga taaaaaagaa 
ctcttccaag ctcatccctg 
aagaagctga gaaaaaagcc 
atacttacaa aacgcttgac 
agcttgaact agtaaaagag 
caaaagcgaa agttgagagt 
atcgtaaaaa agcagaagaa 
aaaaaccagc tgaacaacca 
aaccagagaa gccagctgaa 
aa 


<213> Artificial Sequence 
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< 220 > 

<223> Description: cDNA derived from genome of Streptococcus pneumoniae 


<400> 28 

acagagaagg aggtaactac 
gcaggacata ggaaagctgc 
gatagaagaa aacataccca 
gagtatttgc gtaaattaaa 
acaaaaaaag agatagacgc 
aaaacggtag cagaagctga 
aaagaagaag atcaccgtaa 
gctgagtccg atgtggaagt 
gaatctcgag acgatgaaaa 
gaggctacaa ggttagaaaa 
cgaagagcag aagctaagtt 
gataaaccaa aggggcggag 
aaagaaaatg atgcgaagtc 
tccctgaaac cagaaaaaaa 
aaagccaagg ctcaaaaaga 
cttgaacttg aaattgctga 
aaagaggaag ctaaggaatc 
gagagtaaaa aagctgaggc 
gaagaagaag ctaaacgaaa 
caaccacaac cagcgccggc 
gctcccgcac caaaaccaga 
caacaagctg aagaa 


ccaagtagcc acttcttcta 
tgaacaattc gatgaatata 
aaatttcgcc ttaaacataa 
tgttttagaa gagaagtcga 
agcttttgag cagtttaaaa 
gaagaaggtt gaagaagcta 
ctacccaacc aatacttaca 
t aaaaaag eg gage 1 1 gaac 
aattaagcaa gcagaagega 
catcaagaca gatcgtgaaa 
gaaggaaget gttgaaaaga 
aaaacgagga gttcctggag 
ttcagattct agcgtaggtg 
ggttgcagaa getgagaaga 
agaagatege cgtaactacc 
gtccgatgtg aaagttaaag 
tegaaaegag gaaaaagtta 
tacaaggtta gaaaaaatca 
ageageagaa gaagataaag 
tcctcaacca gaaaaaccaa 
gaagccagct gaacaaccaa 


atagggcaaa tgaaagtcag 
taaaaacaat gatccaatta 
agttgagcag aattaaaacg 
aagctgagtt geegteagaa 
aagataccaa cagaaccaaa 
a^aaaaaagc caaggctcaa 
aaacgcttga aettgaaatt 
tagtaaaaga ggaagetaag 
aagttgagag taaaaaagct 
aageagaaga agaagetaaa 
atgtagegae ttcagagcaa 
agcaagcaac acctgataaa 
aagaagetet tccaagccca 
aggttgcaga agctgagaaa 
caaccaatac ttacaaaacg 
aageggaget tgaactagta 
atcaagcaaa agcgaaagtt 
agacagatcg taaaaaagca 
ttaaagaaaa accagctgaa 
ctgaagagcc tgagaatcca 
aagcagaaaa aacagatgat 
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<210> 29 
<211) 1278 
<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223> Description: cDNA derived from genome of Streptococcus pneumoniae 


BNSDOCID: <WO 9951266A2J_> 


29 



r 


WO 99/51266 


PCT/US99/07680 


<400> 29 

acagagaagg 

acagaacata 

gatagaagaa 

gagtatttgc 

ataaaagcaa 

ccaggaaaaa 

gatcaaaaag 

gacattgctg 

gctaaggaat 

aaagctgagg 

aaacgaagag 

tcaaagaggc 

aatgatgcga 

aaaccagaaa 

gaggat caaa 

cttgaaattg 

gaagctaagg 

aaaaaagctg 

gaagaagct a 

ccacaaccag 

ccagctccag 

gatcaacaag 


aggtaactac 

tgaaagctgc 

aacataccca 

atggattaag 

agttagacgc 

aggtagcaga 

aaaaagatct 

agtccgatgt 

ctcgagacga 

ctacaaggtt 

cagatgctaa 

gggcaaaacg 

agtcttcaga 

aaaaggtagc 

aagaagaaga 

ctgagtccga 

aatctcgaaa 

aggctacaag 

aacgaagagc 

cgccggctcc 

ctccaaaacc 

ctgaagaa 


ccaagtagcc 

taaacaagtc 

aaatgtcggc 

tgtttcaaaa 

agcttttgag 

agctgagaag 

ccgtaactac 

ggaagttaaa 

gaaaaaaatt 

aaaaaacatc 

gttgcaggaa 

agaagttctt 

ttctagcgta 

agaagctgag 

tcgtcgtaac 

tgtggaagtt 

cgaggaaaaa 

gctagaaaac 

agcagaagaa 

tcaaccagaa 

agagaatcca 


acttcttcta 

gatgaatata 

ttactcacaa 

aagaagtcgg 

cagtttaaaa 

aaggttgaag 

ccaaccaata 

aaagcggagc 

aatcaagcaa 

aagacagatc 

gctaatgtag 

ggagagctag 

ggtgaagaaa 

aagaaggttg 

tacccaacca 

aaaaaagcgg 

attaagcaag 

atcaagacag 

gataaagtta 

aaaccaactg 

gctgaaaaac 


ataaggcaaa 

taaaaaaaaa 

agttgggcgt 

aagctgagtt 

aagatacatt 

aagctaagaa 

cttacaaaac 

ttgaactagt 

aagcgaaagt 

gtgaaaaagc 

cgacttcaga 

caacacctga 

ctcttacaag 

aagaagctaa 

atacttacaa 

agcttgaact 

taaaagcgaa 

atcgtaaaaa 

aagaaaaacc 

aagagcctga 

caaaagcaga 


taaaagtcag 

60 

gctccaatta 

120 

aattaaaacg 

180 

gccgtcagaa 

240 

accaacagaa 

300 

aaaagccgag 

360 

gcttgaactt 

420 

aaaagaggaa 

480 

tgagaataaa 

540 

agaagaagct 

600 

gcaagataaa 

660 

taaaaaagaa 

720 

cccatccctg 

780 

gaaaaaagcc 

840 

aacgcttgaa 

900 

agt aaaagag 

960 

agttgagagt 

1020 

agcagaagaa 

1080 

agctgaacaa 

1140 

gaatccagct 

1200 

aaagccagct 

1260 


1278 


<210> 30 
<211> 1276 
<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: cDNA derived from genome of Streptocossus pneumoniae 


<400> 30 

acagagaagg aggtaactac 
acagaacata tgaaagctgc 
gatagaagaa aacataccca 
gagtatttgc atggattaag 
ataaaagcaa agttagacgc 
ccaggaaaaa aggtagcaga 
gatcaaaaag aaaaagatct 
gacattgctg agtccgatgt 
gctaaggaat ctcgagacga 
aaagctgagg ctacaaggtt 
aaacgaagag cagatgctaa 
tcaaagaggc gggcaaaacg 
aatgatgcga agtcttcaga 
aaaccagaaa aaaaggtagc 
gaggatcaaa aagaagaaga 
cttgaaattg ctgagtccga 
gaagctaagg aatctcgaaa 
aaaaaagctg aggctacaag 
gaagaagct a aacgaagagc 
ccacaaccag cgccggctcc 
ccagctccag ctccaaaacc 
gatcaacaag ctgaag 


ccaagtagcc acttcttcta 
taaacaagtc gatgaatata 
aaatgtcggc ttactcacaa 
tgtttcaaaa aagaagtcgg 
agcttttgag cagtttaaaa 
agctgagaag aaggttgaag 
ccgtaactac ccaaccaata 
ggaagttaaa aaagcggagc 
gaaaaaaatt aatcaagcaa 
aaaaaacatc aagacagatc 
gttgcaggaa gctaatgtag 
agaagttttt ggagagctag 
ttctagcgta ggtgaagaaa 
agaagctgag aagaaggttg 
tcgtcgtaac tacccaacca 
tgtggaagtt aaaaaagcgg 
cgaggaaaaa attaagcaag 
gctagaaaac atcaagacag 
agcagaagaa gataaagtta 
tcaaccagaa aaaccaactg 
agagaatcca gctgaaaaac 


ataaggcaaa taaaagtcag 
taaaaaaaaa gctccaatta 
agttgggcgt aattaaaacg 
aagctgagtt gccgtcagaa 
aagatacatt accaacagaa 
aagctaagaa aaaagccgag 
cttacaaaac gcttgaactt 
ttgaactagt aaaagaggaa 
aagcgaaagt tgagaataaa 
gtgaaaaagc agaagaagct 
cgacttcaga gcaagataaa 
caacacctga taaaaaagaa 
ctcttacaag cccatccctg 
aagaagctaa gaaaaaagcc 
atacttacaa aacgcttgaa 
agcttgaact agtaaaagag 
taaaagcgaa agttgagagt 
atcgtaaaaa agcagaagaa 
aagaaaaacc agctgaacaa 
aagagcctga gaatccagct 
caaaagcaga aaagccagct 


60 

120 

180 
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480 
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660 
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<210> 31 
<211> 1272 
<212> DNA 


30 


BN S DOC ID: <WO 9951266A2 I > 



WO 99/51266 


PCT/US99/07680 


.SDOC1D: <WO_ 


<213 > Artificial Sequence 


< 220 > 

<223 > Descritpion: cDNA derived from genome of Streptococcus pneumoniae 


<400> 31 

acagagaagg aggtaactac ccaagtagcc 
acagaacata tgaaagctgc taaacaagtc 
gatagaagaa aacataccca aaatgtcggc 
gagtatttgc atggattaag tgtttcaaaa 
ataaaagcaa agttagacgc agcttttgag 
ccaggtaaaa aggtagcaga agctgagaag 
gatcaaaaag aaaaagatct ccgtaactac 
gacattgctg agtccgatgt ggaagttaaa 
gctaaggaat ctcgagacga gaaaaaaatt 
aaagctgagg ctacaaggtt aaaaaacatc 
aaacgaagag cagatgctaa gttgcaggaa 
tcaaagaggc gggcaaaacg agaagttctt 
aatgatgcga agtcttcaga ttctagcgta 
aaaccagaaa aaaaggtagc agaagctgag 
gaggatcaaa aagaagaaga tcgtcgtaac 
cttgaaattg ctgagtccga tgtggaagtt 
gaagctaagg aatctcgaaa cgaggaaaaa 
aaaaaagctg aggctacaag gctagaaaac 
gaagaagcta aacgaagagc agcagaagaa 
ccacaaccag cgccggctcc tcaaccagaa 
ccagctccag ctccaaaacc agagaatcca 
gatcaacaag ct 


<210> 32 
<21X> 1258 
<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223 > Description: Coding strand 
pneumoniae 

<400> 32 

acagagaacg agagaactac ccaagtaccc 
aggaaagctg ctgaacaatt cgatgaatat 
aaacataccc aaaatttagc cttcaacata 
aatggattaa aagagaagtc ggaagctgag 
gcagctttta agcagtttaa aaaagataca 
gaagctgaga agaaggttga agaagctgag 
aaggctcaaa aagaagaaga tcaccgtaac 
cttgaaattg ctgagttcga tgtgaaagtt 
gaagctgacg aatctcgaaa cgagggcaca 
gaaaaagctg aggctacaag gttaaaaaaa 
gaagaagcta aacgaagagc agatgctaaa 
cggggaaaac gaggagct ct- . tggagagcaa 
aagtcttcag attctagcgt aggtgaagaa 
aaaaaggtag cagaagctga gaagaaggtt 
aaagaagaag atcgccgtaa ctacccaacc 
gctgagtccg atgtgaaagt taaagaagcg 
gaatctcgaa acgaggaaaa aattaagcaa 
gaggctacaa ggttagaaaa aatcaagaca 
cgaaaagcag cagaagaaga taaagttaaa 
ccggctcctc aaccagaaaa accagctgaa 
ccagagaatc cagctgaaca accaaaagca 


acttcttcta atagggcaaa taaaagtcag 60 

gatgaatata taaaaaaaaa gctccaatta 120 
ttactcacaa agttgggcgt aattaaaacg 180 
aagaagtcgg aagctgagtt gccgtcagaa 240 
cagtttaaaa aagatacatt accaacagaa 300 
aaggttgaag aagctaagaa aaaagccgag 360 
ccaaccaata cttacaaaac gcttgaactt 420 
aaagcggagc ttgaactagt aaaagaggaa 480 
aatcaagcaa aagcgaaagt tgagaataaa 540 
aagacagatc gtgaaaaagc agaagaagct €00 
gctaatgtag cgacttcaga gcaagataaa 660 
ggagagctag caacacctga taaaaaagaa 720 
ggtgaagaaa ctcttacaag cccatccctg 780 
aagaaggttg aagaagctaa gaaaaaagcc 840 
tacccaacca atacttacaa aacgcttgaa 900 
aaaaaagcgg agcttgaact agtaaaagag 960 
attaagcaag taaaagcgaa agttgagagt 1020 
atcaagacag atcgtaaaaa agcagaagaa 1080 
gataaagtta aagaaaaacc agctgaacaa 1140 
aaaccaactg aagagcctga gaatccagct 1200 
gctgaaaaac caaaagcaga aaagccagct 1260 

1272 


of cDNA derived from genome of Streptococcus 


acttcttcta ataggggaaa gccagaacgt 60 
ataaacaaaa tgatccaatt agataaaaga 120 
cagttgagca gaattaaaac ggagtatttg 180 
ttgccgtcaa aaataaaagc agagttagac 240 
ttaccaacag aaccagaaaa aaaagtagca 300 
aagaaggtag cagaagctaa gaaaaaagcc 360 
tacccaacca ttacttacaa aacgcttgac 420 
aaagaagcgg agcttgaact agtaaaaaag 480 
attaaccaag caaaagcgaa agttgagagt 540 
atcaagacag atcgtgaaaa agcagaagaa 600 
gagcaagatg aatcaaagag gcgaaagagt 660 
gcaacacctg ataaaaaaga aaatgatgcg 720 
actcttccaa gcccatccct gaaaccagga 780 
gaagaagctg ataaaaaagc caaggctcaa 840 
aatacttaca aaacgcttga acttgaaatt 900 
gagcttgaac tagtaaaaga ggaagctaag 960 
gcaaaagcga aagttgagag taaaaaagct 1020 
gatcgtaaaa aagcagaaga agaagctaaa 1080 
gaaaaaccag ctgaacaacc acaaccagcg 1140 
gagcctgaga atccagttcc agctccaaaa 1200 
gaaaaaccag ctgatcaaca agctgaag 1258 


,9951266A2_1_> 
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<210> 33 
<211> 1242 
< 212 > DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: Coding strand of cDNA derived from genome of Streptococcus 
pneumoniae 

<400> 33 

acagagaacg agggaagtac ccaagcagcc acttcttcta atatggcaaa gacagaacat 
aggaaagctg ctaaacaagt cgtcgatgaa tatatagaaa aaatgttgag ggagattcaa 
ctagatagaa gaaaacatac ccaaaatgtc gccttaaaca taaagttgag cgcaattaaa 
acgaagtatt tgcgtgaatt taatgtttta gaagagaagt cgaaggatga gttgccgtca 
gaaataaaag caaagttaga cgcagctttt gagaagttta aaaaagatac attgaaacca 
ggagaaaagg tagcagaagc taagaagaag gttgaagaag ctaagaaaaa agccgaggat 
caaaaagaag aagatcgtcg taactaccca accaatactt acaaaacgct tgaacttgaa 
attgctgagt tcgatgtgaa agttaaagaa gcggagcttg aactagtaaa agaggaagct 
aaagaatctc gaaacgaggg cacaattaag caagcaaaag agaaagttga gagtaaaaaa 
gctgaggcta caaggttaga aaacatcaag acagatcgta aaaaagcaga agaagaagct 
aaacgaaaag cagatgctaa gttgaaggaa gctaatgtag cgacttcaga tcaaggtaaa 
ccaaaggggc gggcaaaacg aggagttcct ggagagctag caacacctga taaaaaagaa 
aatgatgcga agtcttcaga ttctagcgta ggtgaagaaa ctcttccaag ctcatccctg 
aaatcaggaa aaaaggtagc agaagctgag aagaaggttg aagaagctga gaaaaaagcc 
aaggatcaaa aagaagaaga tcgccgtaac tacccaacca atacttacaa aacgcttgac 
cttgaaattg ctgagtccga tgtgaaagtt aaagaagcgg agcttgaact agtaaaagag 
gaagctaagg aacctcgaga cgaggaaaaa attaagcaag caaaagcgaa agttgagagt 
aaaaaagctg aggctacaag gttagaaaac atcaagacag atcgtaaaaa agcagaagaa 
gaagctaaac gaaaagcagc agaagaagat aaagttaaag aaaaaccagc tgaacaacca 
caaccagcgc cggctactca accagaaaaa ccagctccaa aaccagagaa gccagctgaa 
caaccaaaag cagaaaaaac agatgatcaa caagctgaag aa 


<210> 34 
<211> 1236 
<212> DNA 

<213 > Artificial Sequence 
< 220 > 

<223 > Description: Coding strand from cDNA derived from genome of 

Streptococcus 

pneumoniae 

<400> 34 

gaaggggtta gaagtgagaa taaccccacg gttacatcta gtgggcaaga tatatcgaag 60 

aagtatgctg atgaagtcaa gtcacatcta gaaaaaatat tgagtgagat ccaaacaaat 120 

ttagatagaa gtaaacatat caaaactgta aatctaatta acaaattgca agacattaag 180 

agaacgtatt tgtatgaatt aaatgtttta gaagataagt cgaaagctga gttgccgtca 240 

aaaataaaag cagagttaga cgcagctttt gagcagttta aaaaagatac attaccaaca 300 

gaaccaggaa aaaaggtagc agaagctaag aagaaggttg aagaagctga gaaaaaagcc 360 

aaggctcaaa aagaagaaga ttaccgtaac tacccaacca ttacttacaa aacgcttgaa 420 

cttgaaattg ctgagtccga tgtgaaagtt aaagaagcgg agcttgaact agtaaaaaag 480 

gaagctgacg aatctcgaaa cgagggcaca attaaccaag caaaagcgaa agttgagagt 540 

gaacaagctg aggctacaag gttaaaaaaa atcaagacag atcgtgaaaa agcagaagaa 600 

gaagctaaac gaagagcaga tgctaaagag caagatgaat caaagaggcg aaagagtcgg 660 

gtaaaacgag gagattttgg agagccagca acacctgata aaaaagaaaa tgatgcgaag 720 

tcttcagatt ctagcgtagg tgaagaaact cttccaagcc catccctgaa accaggaaaa 780 

aaggtagcag aagctgagaa gaaggttgaa gaagctgaga aaaaagccaa ggatcaaaaa 840 

gaagaagatc accgtaacta cccaaccatt acttacaaaa cgcttgaact tgaaattgct 900 

gagtccgatg tggaagttaa aaaagcggag cttgaactag taaaagagga agctaaggga 960 

tctcgaaacg aggaaaaagt taagcaagca aaagcggaag ttgagagtaa aaaagctgag 1020 
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gctacaaggt tagaaaaaat caagacagat cgtaaaaaag cagaagaaga agctaaacga 1080 
aaagcagcag aagaagataa agttaaagaa aaaccagctg aacaaccaca accagcgccg 1140 
gctcctcaac cagaaaaacc agctccagct ccaaaaccag agaatccagc tgaacaacca 1200 
aaagcagaaa aaccagctga tcaacaagct gaagaa 1236 


<210> 35 
<211> 1218 
<212> DNA 

<213 > Artificial Sequence 


< 220 > 

<223 > Description: Coding strand from cDNA derived from 


Streptococcus 

pneumoniae 

<400> 35 

acagagaacg agggaactac ccaagcaccc 
gcagaacata tgaaagctgc taaacaagtc 
gatagaagaa aacataccca aaatgtcggc 
gagtatttgc gtggattaag tgtttcaaaa 
ataaaagaaa agttaaccgc agcttttaag 
aaaaaggtag cagaagctga gaagaaggta 
aaagaagaag atcgtcgtaa ctacccaacc 
gctgagtccg atgtggaagt taaaaaagcg 
gaacctcgag acgaggaaaa aattaagcaa 
gaggctacaa ggttaaaaaa aatcaagaca 
cgaagagtag atgctaaaga gcaagatgaa 
cgaggagatc ttggagagca agcaacacct 
gattctagcg taggtgaaga aactcttcca 
gcagaagctg agaagaaggt tgaagaagct 
gatcgccgta actacccaac caatacttac 
gatgtggaag ttaaaaaagc ggagcttgaa 
aacgaggaaa aagttaagca agcaaaagcg 
aggttagaaa aaatcaagac agatcgtaaa 
gcagaagaag ataaagttaa agaaaaacca 
caaccagaaa aaccagctcc aaaaccagag 
ccagctgatc aacaagct 


acttcttcta ataggggaaa tgaaagtcag 
gatgaatata tagaaaaaat gctccaatta 
ttactcacaa agttgggcgc aattaaaacg 
gagaagtcga cagctgagtt gccgtcagaa 
cagtttaaaa aagatacatt gaaaccagaa 
gcagaagcta agaaaaaagc cgaggatcaa 
attacttaca aaacgcttga acttgaaatt 
gagcttgaac tagtaaaagt gaaagctaac 
gcagaagcgg aagttgagag taaaaaagct 
gatcgtgaaa aagcagaaga agaagctaaa 
tcatcaaaga ggcgaaagag tcgggtaaaa 
gataaaaaag aaaatgatgc gaagtcttca 
agcccatccc tgaaaccagg aaaaaaggta 
gataaaaaag ccaaggctca aaaagaagaa 
aaaacgcttg aacttgaaat tgctgagtcc 
ctagtaaaag aggaagctaa ggaacctcga 
gaagttgaga gtaaaaaagc tgaggctaca 
aaagcagaag aagaagctaa acgaaaagca 
gctgaacaac caaaaccagc gccggctcct 
aatccagctg aacaaccaaa agcagaaaaa 


genome of 
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<210> 36 
< 211 > 102 
<212> PRT 

<213 > Artificial Sequence 
< 220 > 

<22 3 > Description: Amino acid sequence derived from consensus cDNA sequence 


<400> 36 


Thr 

1 

Glu 

Asn 

Glu 

Gly 

5 

Thr 

Thr 

Gin 

Asn 

Gin 

Thr 

Glu 

20 

His 

Arg 

Lys 

Ala 

lie 

Lys 

Lys 

35 

Met 

Leu 

Glu 

Gin 

Leu 

40 

Val 

Ala 

50 

Leu 

Asn 

He 

Lys 

Leu 

55 

Ser 

Glu 

Leu 

Asn 

Val 

Leu 

Glu 

Glu 

Lys 


Val 

Ala 

10 

Thr 

Ser 

Ser 

Asn 

Arg 

15 

Ala 

Ala 

25 

Lys 

Gin 

Val 

Val 

Asp 

30 

Glu 

Tyr 

Asp 

Arg 

Arg 

Lys 

His 

45 

Thr 

Gin 

Asn 

Ala 

lie 

Lys 

Thr 

60 

Glu Tyr 

Leu 

Arg 

Ser 

Lys 

Ala 

Glu 

Leu 

Pro 

Ser 

Glu 
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65 

lie Lys Ala 
Leu Lys Thr 


Lys Leu Asp Ala Ala Phe Glu Gin Phe Lys Lys Asp Thr 
85 90 95 

Glu Pro Gly 
100 


<210> 37 
<211> 55 
<212 > PRT 

<213 > Artificial Sequence 


< 220 > 

<223> 


Description: Amino acid sequence derived from a cDNA consensus sequence 
from the genome of Streptococcus pneumoniae 


Isp Ala Lys Leu Glu Ala Thr Ser Glu Gin Asp Lys Pro Lys Gly Arg 
1 5 

Ala Lys Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu 
20 25 

Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro 
35 40 45 

Ser Pro Ser Leu Lys Pro Glu 
SO 55 


<210> 38 
<211> 103 
<212 > PRT 

<213 > Artificial Sequence 


< 220 > 
<223 > 
CDNA 


Description: Amino acid sequence derived from consensus sequence of 

derived from the genome of Streptococcus pneumoniae 


Lys°Lys 8 Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Lys Lys Lys 

Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr He Thr 
20 25 

Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys 
35 40 45 

Lys Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Ser Arg Asp 
50 55 60 

Glu Gly Lys lie Lys Gin Ala Lys Ala Lys Val Glu Ser Lys Lys Ala 
65 70 75 

Glu Ala Thr Arg Leu Lys Lys He Lys Thr Asp Arg Glu Lys Ala Glu 
85 90 95 


Glu Glu Ala Lys Arg Arg Ala 
100 
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